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U.S. APPLICATION NO. (If known, see 37 CFR 1.5) 



10/030529 



INTERNATIONAL APPLICATION NO. 
PCT/US00/18834 



i INTERNATIONAL FILING DATE 
I July 7, 2000 



PRIORITY DATE CLAIMED 
July 9, 1999 



TITLE OF INVENTION 

DSRA PROTEIN AND POLYNUCLEOTIDES ENCODING THE SAME 



APPLICANT(S) FOR DO/EO/US 
CHRISTOPHER ELKINS 



Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 

1. £3 This is a FIRST submission of items concerning a filing under 35 U.S.C. 371. 

2. □ This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371. 

3. 3 This is an express request to begin national examination procedures (35 U.S.C. 371(f)). The submission must include 

items (5), (6), (9) and (21) indicated below. 

4. The US has been elected by the expiration of 19 months from the priority date (Article 31). 
5^ S A copy of the International Application as filed (35 U.S.C. 371(c)(2)) 

0 a. [X] is attached hereto (required only if not communicated by the International Bureau), 
b. O has been communicated by the International Bureau. 

JiJ c. □ is not required, as the application was filed in the United States Receiving Office (RO/US). 
€H □ An English language translation of the International Application as filed (35 U.S.C. 371(c)(2)). 

1 y a. Q is attached hereto. 

y b. □ has been previously submitted under 35 U.S.C. 154(d)(4) 

•Q Amendments to the claims of the International Application Under PCT Article 19 (35 U.S.C. 371(c)(3)) 
HI a. □ are attached hereto (required only if not communicated by the International Bureau), 
if b. □ have been communicated by the International Bureau. 

f| c. □ have not been made; however, the time limit for making such amendments has NOT expired. 

fU d. 13 have not been made and will not be made. 

8. □ An English language translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371(c)(3)). 

9. □ An oath or declaration of the inventor(s) (35 U.S.C. 371(c)(4)). 

10. □ An English language translation of the annexes of the International Preliminary Examination Report Under PCT 

Article 36 (35 U.S.C. 371(c)(5)). 



Items 11 to 20 below concern document(s) or information included: 

11. □ An Information Disclosure Statement under 37 CFR 1 .97 and 1 .98 . 

12. □ An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.3 1 is included. 

13. 3 A FIRST preliminary amendment. 

14. □ A SECOND or SUBSEQUENT preliminary amendment. 

15. □ A substitute specification. 

16. □ A change of power of attorney and/or address letter. 

17. A computer-readable form of the sequence listing in accordance with PCT Rule 13ter.2 and 35 U.S.C. 1.821 - 1.825. 

1 8. □ A second copy of the published international application under 35 U.S.C. 1 54(d)(4). 

19. □ A second copy of the English language translation of the international application under 35 U.S.C. 154(d)(4) 

20. [3 Other items or information: Demand, International Preliminary Examination Report, International Search Report, 



r ^ - o 9 2002 



U.S. APPLICATipN.NO. fif fc»?wfesee ( 37 I 



21. □ The following fees are submitted: 
BASIC NATIONAL FEE (37 CFR 1.492(a) (1) - (5)): 

Neither international preliminary examination fee (37 CFR 1.482) 
nor international search fee (37 CFR 1.445(a)(2)) paid to USPTO 
and International Search Report not prepared by the EPO or JPO. . . . 



International preliminary examination fee (37 CFR 1 .482) not paid to 
USPTO but International Search Report prepared by the EPO or JPO. . . 



CALCULATIONS 



PTO USE ONLY 



$1040.00 
. $890.00 



International preliminary examination fee 37 CFR 1.482) not paid to USPTO 

but international search fee (37 CFR 1 .445(a)(2)) paid to USPTO $740.00 

International preliminary examination fee (37 CFR 1.482) paid to USPTO 

but all claims did not satisfy provisions of PCT Article 33(1 )-(4) $710.00 

International preliminary examination fee (37 CFR 1.482) paid to USPTO 

and all claims satisfied provisions of PCT Article 33(l)-(4) $100.00 

ENTER APPROPRIATE BASE FEE AMOUNT = 



rcharge of $130.00 for furnishing the oath or declaration later than □ 20 □ 30 months 
m the earliest claimed priority date (37 CFR 1.492(e)). 



NUMBER FILED NUMBER EXTRA RATE 



tr 



jCLILTIPLE DEPENDENT CLAIM(S) (if applicable) 



TOTAL OF ABOVE CALCULATIONS = 



Applicant claims small entity status. See 37 CFR 1.27. The fees indicated above are 
reduced by 1/2 



SUBTOTAL = 



grjbcessing fee of $130.00 for furnishing the English translation later than □ 20 □ 30 
laenths from the earliest claimed priority date (37 CFR 1.492(f)). 



TOTAL NATIONAL FEE = 



tge for Recording the enclosed assignment (37 CFR 1.21(h)). The assignment must be 
j&pompanied by an appropriate cover sheet (37 CFR 3.28, 3.31). $40.00 per property 



TOTAL FEES ENCLOSED = 



a. □ A check in the amount of $ to cover the above fees is enclosed. 

b. K Please charge my Deposit Account No. 50-0220 in the amount of$72L00 to cover the above fees. A duplicate copy of 

this sheet is enclosed. 

c. [X] The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any overpayment 

to Deposit Account No. 50-0220. A duplicate copy of this sheet is enclosed. 

d. □ Fees are to be charged to a credit card. WARNING: Information on this form may become public. Credit card 

information should not be included on this form. Provide credit card information and authorization on PTO-2038. 

NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 hps not been met, a politic 
must be filed and granted to restore the application to pending status. 

SEND ALL CORRESPONDENCE TO: 



iBiuwiiiiiiiwiw 

20792 

PATENT TRADEMARK OFFICE 




[7 CFR 1.137(a) or (b)) 



CERTIFICATE OF EXPRESS MAILING 

Express Mail Label No. EV015665109US 
Date of Deposit: January 9, 2002 

I hereby certify that this correspondence is being deposited with the United 
States Postal Service "Express Mail Post Office to Addressee" service under 37 CFR 
1.10 on the date indicated above and is addressed to: BOX PCT, Attn: DO/EOAJS, 
Commissioner for Patents, Washington, DC 20231. 



Morilca L. Croom 
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RAW SEQUENCE LISTING DATE: 02/08/2002 

PATENT APPLICATION: US/10/030,529 TIME: 10:11:16 

Input Set : A:\PTO.VSK.txt 

Output Set: N:\CRF3\02082002\J030 529.raw ENTERED 

3 <110> APPLICANT: University of North Carolina-Chapel Hill 

4 Elkins, Christopher 

u 6 <120> TITLE OP INVENTION: Isolated Polynucleotides Encoding DsrA, A Protein Conferring 
SerufiT" 

^7 Resistance To H. ducreyi, And Methods And Compositions Comprising The Same 

Cl9 <130> FILE REFERENCE: 5470-269. WO 

C-->W.l <140> CURRENT APPLICATION NUMBER: US/10/030,529 

C--JO-1 <141> CURRENT FILING DATE: 2002-01-09 

mi <160> NUMBER OF SEQ ID NOS : 18 

Sj.3 <170> SOFTWARE: Patentln version 3.1 

jjjj.5 <210> SEQ ID NO: 1 

Jl6 <211> LENGTH: 116 8 

Li 7 <212> TYPE: DNA 

3.8 <213> ORGANISM: Haemophilus ducreyi 
<4 00> SEQUENCE: 1 

M&l ataaatacgt cattgacatt tttttaatgt aaggtagaat aagaaagtaa attctatatt 60 

U23 tacaatcaag attgacaatt atttacttaa tgaggtgatt atgaaaatta aatgtttagt 120 

ip25 tgccgtagtg ggattagctt gttctactat tacaacaatg gctcagcagc cgccaaagtt 180 

|1j 7 tgctggagta tcttctttgt atagctatga gtatgactat ggtaagggta aatggacttg 24 0 

29 gtctaatgaa ggcggtttcg atattaaagt gccagggatt aaaatgaagc caaaagaatg 300 

31 gatttctaaa caggctactt atcttgaatt acagcattat atgccttata ctcctgttct 360 

33 cgtgacatat gctcctggcg tttctcctag ccctatactg ttatatccga tgtctgatcc 420 

35 tgatcaactt ggaataaatc ggcagcagct gaaattgaat ttgtatagtt attttaacga 480 

37 tttaagacac gattttaaat taaaagttct tgatgcacgt atttccaaaa ataaacaaaa 540 

39 tattgatact ataagtaaat atttactaga actgggtact tatttagatg attcttatcg 6 00 

41 tatgatggaa caaaatacac ataatatcaa taagttgtct aaagaattgc aaactggttt 660 

43 agccaaccaa tcagcattgt ctatgttagt gcaaccaaat ggtgtaggca aaacgagcgt 720 

45 ttctgctgcg gtaggaggtt atagagataa aactgcatta gccattggtg tcggctcacg 780 

47 cattactgat cgctttaccg ctaaagcggg tgtagcgttc aatacctaca atggcggcat 840 

49 gtcttatggt gcttctgttg gttatgaatt ctaatcatta cgtttaatca ctaatcgttt 900 

51 tggttataat aaaaaggcta aatgtttctc ctcacattta gcctttctta tttatctttg 960 

53 ttatagcttt tgctgttata aaaccgtttt ttagccactt ttattaatta agcttttaag 102 0 

55 cctattcaat cagttctact ttcacttttt tcaccatatt atccgccact tctaaaacgg 10 80 

57 taatattaag ttggtttagc ctaaattggg taccttctat cggaattttt tctaaatgtt 114 0 

59 ctaaaattaa gccgttaaag gtgcggac 1168 

62 <210> SEQ ID NO: 2 

63 <211> LENGTH: 257 

64 <212> TYPE: PRT 

65 <213> ORGANISM: Haemophilus ducreyi 
67 <400> SEQUENCE: 2 

69 Met Lys lie Lys Cys Leu Val Ala Val Val Gly Leu Ala Cys Ser Thr 

70 1 5 10 15 

73 He Thr Thr Met Ala Gin Gin Pro Pro Lys Phe Ala Gly Val Ser Ser 

74 20 25 30 
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RAW SEQUENCE LISTING DATE: 02/08/2002 

PATENT APPLICATION: US/10/030,529 TIME: 10:11:16 

Input Set : A:\PTO.VSK.txt 

Output Set: N:\CRF3\02082002\J030529.raw 



77 Leu Tyr Ser Tyr Glu Tyr Asp Tyr Gly Lys Gly Lys Trp Thr Trp Ser 

78 35 40 45 

81 Asn Glu Gly Gly Phe Asp lie Lys Val Pro Gly lie Lys Met Lys Pro 

82 50 55 60 

85 Lys Glu Trp lie Ser Lys Gin Ala Thr Tyr Leu Glu Leu Gin His Tyr 

86 65 70 75 80 

89 Met Pro Tyr Thr Pro Val Leu Val Thr Tyr Ala Pro Gly Val Ser Pro 

90 85 90 95 

93 Ser Pro He Leu Leu Tyr Pro Met Ser Asp Pro Asp Gin Leu Gly He 

94 100 105 110 

97 Asn Arg Gin Gin Leu Lys Leu Asn Leu Tyr Ser Tyr Phe Asn Asp Leu 
:JB 115 120 125 

1=401 Arg His Asp Phe Lys Leu Lys Val Leu Asp Ala Arg He Ser Lys Asn 
y.02 130 135 140 

J=405 Lys Gin Asn He Asp Thr He Ser Lys Tyr Leu Leu Glu Leu Gly Thr 

y.06 145 150 155 160 

Q.09 Tyr Leu Asp Asp Ser Tyr Arg Met Met Glu Gin Asn Thr His Asn He 

UllO 165 170 175 

fdl 3 Asn Lys Leu Ser Lys Glu Leu Gin Thr Gly Leu Ala Asn Gin Ser Ala 

#14 180 185 190 

s 117 Leu Ser Met Leu Val Gin Pro Asn Gly Val Gly Lys Thr Ser Val Ser 

?J.18 195 200 205 

5*3.21 Ala Ala Val Gly Gly Tyr Arg Asp Lys Thr Ala Leu Ala He Gly Val 

3.22 210 215 220 

Jg.25 Gly Ser Arg He Thr Asp Arg Phe Thr Ala Lys Ala Gly Val Ala Phe 
Jfj.26 225 230 235 240 

M.29 Asn Thr Tyr Asn Gly Gly Met Ser Tyr Gly Ala Ser Val Gly Tyr Glu 
11.30 245 250 255 

13 3 Phe 

137 <210> SEQ ID NO: 3 

138 <211> LENGTH: 1205 

139 <212> TYPE: DNA 

140 <213> ORGANISM: Haemophilus ducreyi 
142 <4 00> SEQUENCE: 3 

14 3 attttataat ttacaataca ttttatattt ttatattata taaatacgtc attgacattt 6 0 
14 5 ttttaaggta gaataagaaa gtaaattcta tatttacaat caagattgac aattatttac 120 
14 7 ttaatgaggt gattatgaaa attaaatgtt tagttgccgt agtgggatta gcttgttcta 180 
149 ctattacaac aatggctcag eagccgccaa agtttgctgg agtatcttct ttgtatagct 240 
151 atgagtatga ctatggtaag ggtaaatgga cttggtctaa tgaaggcggt ttcgatatta 300 
153 aagtgccagg gattaaaatg aagccaaaag aatggatttc taaacaggct acttatcttg 360 
155 aattacagca ttatatgcct tatactcctg ttctcgtgac atatgctcat gacgttcctc 420 
157 ctagctctat actgttatat ccgatgtctg atcctgatca acttggaata aatcggcagc 480 
159 agctgaaatt gaatttgtat agttatttta acgatttaag acacgatttt aaattaaaag 540 
161 ttcttgatgc acgtatttcc aaaaataaac aaaatattga tactataagt aaatatttac 600 
163 tagaactggg tacttattta gatgattctt atcgtatgat ggaacaaaat acacataata 660 
165 tcaataaaaa tacacataat atcaataagt tgtctaaaga attgcaaact ggtttagcca 720 
167 accaatcagc attgtctatg ttagtgcaac caaatggtgt aggcaaaacg agcgtttctg 780 
169 ctgcggtagg aggttataga gataaaactg cattagccat tggtgtcggc tcacgcatta 840 
171 ctgatcgctt taccgctaaa gcgggtgtag cgttcaatac ctacaatggc ggcatgtctt 900 
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PATENT APPLICATION: US/10/030,529 TIME: 10:11:16 

Input Set : A:\PTO.VSK.txt 
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173 


atggtgcttc tgttggttat gaattctaat cattacgttt 


aatcactaat cgttttggtt 


960 


175 


ataataaaaa ggctaaatgt ttctcctcac atttagcctt 


tcttatttat ctttgttata 


1020 


177 


gcttttgctg ttataaaacc gttttttagc 


; cacttttatt 


aattaagctt ttaagcctat 


1080 


179 


tcaatcagtt ctactttcac ttttttcacc atattatccg 


ccacttctaa aacggtaata 


1140 


181 


ttaagttggt ttagcctaaa ttgggtacct tctatcggaa 


ttttttctaa atgttctaaa 


1200 


183 


attaa 






























1205 


186 


<210> SEQ ID NO: 


: 4 


























187 


<211> LENGTH: 264 


























188 


<212> TYPE: 


PRT 




























189 


<213> ORGANISM: 


Haemophilus 


ducreyi 


















191 


<4 00> SEQUENCE: 


4 


























193 


Met 


Lys 


He 


Lys 


Cys 


Leu 


Val 


Ala 


Val 


Val 


Gly 


Leu 


Ala 


Cys 


Ser 


Thr 




Ml94 


1 








5 










10 










15 






Q-97 


He 


Thr 


Thr 


Met 


Ala 


Gin 


Gin 


Pro 


Pro 


Lys 


Phe 


Ala 


Gly 


Val 


Ser 


Ser 




QL9 8 








20 










25 










30 








h'201 


Leu 


Tyr 


Ser 


Tyr 


Glu 


yr 


sp 


Tyr 


Gly 


Lys 


Gly 


Lys 


Trp 


Thr 


Trp 


Ser 




-2 C 2 






35 










40 










45 










' 2 0 5 


Asn 


Glu 


Gly 


Gly 


Phe 


Asr> 


He 


Lys 


Val 


Pro 


Gly 


He 


Lys 


Met 


Lys 


Pro 




20c 




50 










55 










60 












;t 0 3 


Lys 


Glu 


Trp 


He 


Ser 


Lys 


Gin 


Ala 


Thr 


T 


Leu 


Glu 


Leu 


Gin 


His 


Tyr 




*210 


65 










70 










75 










80 




:_2i3 


Met 


Pro 


Tyr 


Thr 


Pro 


Val 


Leu 


Val 


Thr 


Tyr 


Ala 


His 


Asp 


Val 


Pro 


Pro 




=2 14 










85 










90 










95 






"217 


Ser 


Ser 


He 


Leu 




Tvr 


Pro 


Met 


Ser 


Asp 


Pro 


Asp 


Gin 


Leu 


Gly 


He 




.1218 








100 










105 










110 








m 2 i 


Asn 


Arg 


Gin 


Gin 


Leu 


ys 


Leu 


Asn 




Tyr 


Ser 


Tyr 


Phe 




Asp 


Leu 




- 222 






115 










120 










125 












Arg 


His 


Asp 


Phe 


Lys 


Leu 


Lys 


Val 


Leu 


Asp 


Ala 


Arg 


He 


Ser 


Lys 


Asn 




'^26 




130 










135 










140 












229 


Lys 


Gin 


Asn 


He 


Asp 


Thr 


He 


Ser 


Lys 


Tvr 


Leu 




Glu 


Leu 


Gly 


Thr 




230 


145 










150 










155 










160 




233 


Tyr 


Leu 


Asp 


Asp 


Ser 


Tyr 


Arg 


Met 


Met 


Glu 


Gin 


Asn 


Thr 


His 


Asn 


He 




234 










165 










170 










175 






237 


Asn 


Lys 


Asn 


Thr 


His 


Asn 


He 


Asn 


Lys 


Leu 


Ser 


Lys 


Glu 


Leu 


Gin 


Thr 




238 








180 










185 










190 








241 


Gly 


Leu 


Ala 


Asn 


Gin 


Ser 


Ala 


Leu 


Ser 


Met 


Leu 


Val 


Gin 


Pro 


Asn 


Gly 




242 






195 










200 










205 










245 


Val 


Gly 


Lys 


Thr 


Ser 


Val 


Ser 


Ala 


Ala 


Val 


Gly 


Gly 


Tyr 


Arg 


Asp 


Lys 




246 




210 










215 










220 












249 


Thr 


Ala 


Leu 


Ala 


He 


Gly 


Val 


Gly 


Ser 


Arg 


He 


Thr 


Asp 


Arg 


Phe 


Thr 




250 


225 










230 










235 










240 




253 


Ala 


Lys 


Ala 


Gly 


Val 


Ala 


Phe 


Asn 


Thr 


Tyr 


Asn 


Gly 


Gly 


Met 


Ser 


Tyr 




254 










245 










250 










255 






257 


Gly 


Ala 


Ser 


Val 


Gly 


Tyr 


Glu 


Phe 




















258 








260 




























261 


<210> SEQ ID NO: 


: 5 


























262 


<211> LENGTH: 952 


























263 


<212> TYPE: 


DNA 




























264 


<213> ORGANISM: 


Haemophilus 


ducreyi 
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RAW SEQUENCE LISTING DATE: 02/08/2002 

PATENT APPLICATION: US/10/0 30,529 TIME: 10:11:16 

Input Set : A:\PTO.VSK.txt 

Output Set: N:\CRF3\02082002\J030529.raw 

266 <400> SEQUENCE: 5 

267 attttataat ttacaataca ttttatattt ttatattata taaatacgtc attgacattt 60 
269 ttttaaggta gaataagaaa gtaaattcta tatttacaat caagattgac aattatttac 120 
271 ttaatgaggt gattatgaaa attaaatgtt tagttgccgt agtgggatta gcttgttcta 180 
273 ctattacaac aatggctcag cagccgccaa agtttgctgg agtatcttct ttgtatagct 24 0 
275 atgagtatga ctatggtaag ggtaaatgga cttggtctaa tgaaggcggt ttcgatatta 300 
277 aagtgccagg gattaaaatg aagccaaaag aatggatttc taaacaggct acttatcttg 360 
2 79 aattacagca ttatatgcct tatactcctg ttctcgtgac atatgctcat gacgttcctc 420 
281 ctagctctat actgttatat ccgatgtctg atcctgatca acttggaata aatcggcagc 480 
283 agctgaaatt gaatttgtat agttatttta acgatttaag acacgatttt aaattaaaag 540 
285 ttcttgatgc acgtatttcc aaaaataaac aaaatattga tactataagt aaatatttac 600 

_ 287 tagaactggg tacttattta gatgattctt atcgtatgat ggaacaaaat acacataata 660 
H289 tcaataaaaa tacacataat atcaataagt tgtctaaaga attgcaaact ggtttagcca 720 
==291 accaatcagc attgtctatg ttagtgcaac caaatggtgt aggcaaaacg agcgtttctg 780 
G293 ctgcggtagg aggttataga gataaaactg cattagccat tggtgtcggc tcacgcatta 840 
U.J295 ctgatcgctt taccgctaaa gcgggtgtag cgttcaatac ctacaatggc ggcatgtctt 900 
p297 atggtgcttc tgttggttat gaattctaat cattacgttt aatcactaat eg 952 
fJ1300 <210> SEQ ID NO: 6 
p|301 <211> LENGTH: 264 
[f?02 <212> TYPE: PRT 

"303 <213> ORGANISM: Haemophilus ducreyi 
L305 <4 00> SEQUENCE: 6 

hl3 07 Met Lys lie Lys Cys Leu Val Ala Val Val Gly Leu Ala Cys Ser Thr 
yj308 1 5 10 15 

Gil lie Thr Thr Met Ala Gin Gin Pro Pro Lys Phe Ala Gly Val Ser Ser 
01312 20 25 30 

£315 Leu Tyr Ser Tyr Glu Tyr Asp Tyr Gly Lys Gly Lys Trp Thr Trp Ser 
f|316 35 40 45 

319 Asn Glu Gly Gly Phe Asp lie Lys Val Pro Gly lie Lys Met Lys Pro 

320 50 55 60 

32 3 Lys Glu Trp lie Ser Lys Gin Ala Thr Tyr Leu Glu Leu Gin His Tyr 
324 65 70 75 80 

327 Met Pro Tyr Thr Pro Val Leu Val Thr Tyr Ala His Asp Val Pro Pro 

328 85 90 95 

331 Ser Ser lie Leu Leu Tyr Pro Met Ser Asp Pro Asp Gin Leu Gly lie 

332 100 105 110 

33 5 Asn Arg Gin Gin Leu Lys Leu Asn Leu Tyr Ser Tyr Phe Asn Asp Leu 
336 115 120 125 

339 Arg His Asp Phe Lys Leu Lys Val Leu Asp Ala Arg He Ser Lys Asn 

340 130 135 140 

34 3 Lys Gin Asn He Asp Thr He Ser Lys Tyr Leu Leu Glu Leu Gly Thr 
344 145 150 155 160 

347 Tyr Leu Asp Asp Ser Tyr Arg Met Met Glu Gin Asn Thr His Asn He 

348 165 170 175 

351 Asn Lys Asn Thr His Asn He Asn Lys Leu Ser Lys Glu Leu Gin Thr 

352 180 185 190 

355 Gly Leu Ala Asn Gin Ser Ala Leu Ser Met Leu Val Gin Pro Asn Gly 

356 195 200 205 

359 Val Gly Lys Thr Ser Val Ser Ala Ala Val Gly Gly Tyr Arg Asp Lys 
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RAW SEQUENCE LISTING DATE: 02/08/2002 

PATENT APPLICATION: US/10/0 30,529 TIME: 10:11:16 

Input Set : A:\PTO.VSK.txt 

Output Set: N:\CRF3\02082002\J030529.raw 

360 210 215 220 

363 Thr Ala Leu Ala lie Gly Val Gly Ser Arg lie Thr Asp Arg Phe Thr 

364 225 230 235 240 

367 Ala Lys Ala Gly Val Ala Phe Asn Thr Tyr Asn Gly Gly Met Ser Tyr 

368 245 250 255 

371 Gly Ala Ser Val Gly Tyr Glu Phe 

372 260 

375 <210> SEQ ID NO : 7 

376 <211> LENGTH: 899 

377 <212> TYPE: DNA 

378 <213> ORGANISM: Haemophilus ducreyi 
380 <400> SEQUENCE: 7 

y381 ttttataatt tacaatacat tttatatttt tatattatat aaatacgtca ttgacatttt 6 0 

p383 tttaatgtaa ggtagaataa gaaagtaaat tctatattta caatcaagat tgacaattat 12 0 
^385 ttacttaatg aggtgattat gaaaattaaa tgtttagttg ccgtagtggg attagcttgt 180 
7^87 tctactatta caacaatggc tcagcagccg ccaaagtttg ctggagtatc ttctttgtat 240 
S 89 agctatgagt atgactatgg taagggtaaa tggacttggt ctaatgaagg cggtttcgat 300 
=391 attaaagtgc cagggattaa aatgaagcca aaagaatgga tttctaaaca ggctacttat 360 
^B93 cttgaattac agcattatat gccttatact cctgttctcg tgacatatgc tcctggcgtt 420 
iTA95 tctcctagcc ctatactgtt atatccgatg tctgatcctg atcaacttgg aataaatcgg 480 
yQ397 cagcagctga aattgaattt gtatagttat tttaacgatt taagacacga ttttaaatta 540 
;s 399 aaagttcttg atgcacgtat ttccaaaaat aaacaaaata ttgatactat aagtaaatat 600 
S01 ttactagaac tgggtactta tttagatgat tcttatcgta tgatggaaca aaatacacat 660 
Uf03 aatatcaata agttgtctaa agaattgcaa actggtttag ccaaccaatc agcattgtct 720 
rt 05 atgttagtgc aaccaaatgg tgtaggcaaa acgagcgttt ctgctgcggt aggaggttat 780 
JS 07 agagataaaa ctgcattagc cattggtgtc ggctcacgca ttactgatcg ctttaccgct 840 
J|09 aaagcgggtg tagcgttcaa taccttctat cggaattttt tctaaatgtt ctaaaatta 899 
!rfl2 <210> SEQ ID NO: 8 
!l 4l3 <211> LENGTH: 242 

414 <212> TYPE: PRT 

415 <213> ORGANISM: Haemophilus ducreyi 
417 <400> SEQUENCE: 8 

419 Met Lys lie Lys Cys Leu Val Ala Val Val Gly Leu Ala Cys Ser Thr 

420 15 10 15 

42 3 He Thr Thr Met Ala Gin Gin Pro Pro Lys Phe Ala Gly Val Ser Ser 
424 20 25 30 

42 7 Leu Tyr Ser Tyr Glu Tyr Asp Tyr Gly Lys Gly Lys Trp Thr Trp Ser 
428 35 40 45 

431 Asn Glu Gly Gly Phe Asp He Lys Val Pro Gly He Lys Met Lys Pro 

432 50 55 60 

435 Lys Glu Trp He Ser Lys Gin Ala Thr Tyr Leu Glu Leu Gin His Tyr 

436 65 70 75 80 

439 Met Pro Tyr Thr Pro Val Leu Val Thr Tyr Ala Pro Gly Val Ser Pro 

440 85 90 95 

44 3 Ser Pro He Leu Leu Tyr Pro Met Ser Asp Pro Asp Gin Leu Gly He 

444 100 105 110 

44 7 Asn Arg Gin Gin Leu Lys Leu Asn Leu Tyr Ser Tyr Phe Asn Asp Leu 

448 115 120 125 

451 Arg His Asp Phe Lys Leu Lys Val Leu Asp Ala Arg He Ser Lys Asn 
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Attorney's Docket No. 5470-269 PATENT 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

ATTN: DO/EO/US 

In re: Christopher Elkins 
Serial No.: to be assigned 
Filed: concurrently herewith 

For: DSRA PROTEIN AND POLYNUCLEOTIDES ENCODING THE SAME 

Date: January 9, 2002 

Box PCT 

Commissioner for Patents 
Washington, DC 20231 

PRELIMINARY AMENDMENT 

Dear Sirs: 

Prior to the examination of the above application, please amend the above-identified 
application as follows. A "Version with Markings to Show Changes Made" is attached at 
page 3. 

In the Specification: 

At page 1, following the title, please insert the following: 

-This application claims priority to PCT Application number PCT/US00/18834 filed 
in English on July 7, 2000 claming priority from U.S. Provisional Patent Application number 
60/143,257 filed on July 9, 1999, the disclosures of which are hereby incorporated herein by 
reference in their entirety. — 

In the Claims: 

Please amend Claim 28 as follows: 

28. (Amended) A method for inducing a protective immune response in a subject at 
risk of developing H. ducreyi infection comprising administering to the subject a vaccine 
according to Claim 24 in an amount sufficient to induce an immune response. 



In re: Christopher Elkins 
Serial No.: to be assigned 
Filed: concurrently herewith 
Page 2 of 3 



REMARKS 

The above amendments are made to claim priority to the above-identified patent 
applications. Claim 28 has been amended to conform to U.S. practice. 
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Respectfully submitted, 




. Michael "Smckland 
Registration No. 47,1 15 



1,1111 lll'llll 

20792 



CERTIFICATE OF EXPRESS MAILING 
Express Mail Label No. EV015665109US 
Date of Deposit: January 9, 2002 

I hereby certify that this correspondence is being deposited with the United States Postal Service "Express Mail 
Post Office to Addressee" service under 37 CFR 1.10 on the date indicated above and is addressed to: BOX PCT, 
Attn: DO/EO/US, Commissioner for Patents, Washington, DC 2023 1 . 



Monica L. Croom 



In re: Christopher Elkins 
Serial No.: to be assigned 
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Version with Markings to Show Changes Made 



28. (Amended) A method for inducing a protective immune response in a subject at 
risk of developing H. ducreyi infection comprising administering to the subject a vaccine 
according to [one of Claims 20-27] Claim 24 in an amount sufficient to induce an immune 
response. 
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Isolated Polynucleotides Encoding DsrA, A Protein 
Conferring Serum Resistance To H. ducreyi, 
And Methods And Compositions Comprising The Same 

Statement of Federal Support 

This invention was made with United States Government support under 
grant numbers AI 40263 and A126837 from the National Institutes of Health 
(Public Health Service). The United States Government has certain rights to this 
invention. 

Field of The Invention 

This invention relates to proteins that are involved in the serum resistance 
of H. ducreyi. 

Background of the Invention 

Haemophilus ducreyi is the etiologic agent of chancroid, a genital ulcer 
disease transmitted by sexual contact. See, e.g., Albritton, W. L., Microbiol Rev. 
53:377-89 (1989); Trees, D. L., and S. A. Morse, Clin Microbiol Rev. 8, 357-375 
(1 995). Chancroid has gained importance recently because it has been implicated 
as an independent risk factor for the heterosexual transmission of HIV in Africa. 
See Albritton, supra, Trees, supra; R.M. Greenblattet et al., AIDS 2, 47-50 (1988); 
Jessamine, P. G., and A. R. Ronald, Med Clin North Am. 74, 1417-31 (1990); 
Plummer, F. A. et al., J Infect Dis. 161, 810-1 (1990); D. L., and S. A. Morse, 
Clin. Microbiol Rev. 8, 357-375 (1995); Wasserheit, J. N., Sex Trans Dis. 19, 61- 
77 (1991). 

Serum resistance has been shown in numerous bacterial systems to be 
critical for the survival of invading bacterial and the establishment of disease, since 
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mutations which result in the loss of serum resistance renders several bacterial 
pathogens avirulent. See. e.g., Blaser, M. J., American Journal of the Medical 
Sciences. 306, 325-9 (1993); Corbeil, L. B., Canadian Journal of Veterinary 
Research. 54,S57-62 (1990); Mobley, H. L. et al., Kidney International - 
Supplement. Al, S 129-36 (1 994); Rice, P. A., Clinical Microbiology Reviews. 2, 
SI 12-7 (1989); and Stall, T. L., and J. J. LiPuma, Medical Clinics of North 
America. 75, 287-9 (1991). In most systems, the serum resistance phenotype is the 
product of multiple genes. H. ducreyi is resistant to high levels of normal human 
serum (NHS; up to 50%). Early studies on H. ducreyi serum resistance by 
Odumeru and colleagues concluded that truncation of LOS in several strains was 
associated with avirulence and loss of serum resistance (see Odumeru, J. A., et al , 



Infect. Immun. 43, 607-61 1 (1984); Odumeru, J. A. et al., Infect. Immun. 50, 495-9 
(1985); Odumeru, J. A. et al., J Med Microbiol. 23, 155-62 (1987)), whereas a 



W 

i 

nj 

m recent study came to the opposite conclusion. See Hiltke, T. J. et al, Microb 

L Path. 26,93-102(1999) 

Originally described as a cell spreading factor, vitronectin is now 
(J! recognized as a multifunctional regulatory adhesive glycoprotein involved in a 

variety of extracellular processes such as the attachment and spreading of normal 
and neoplastic cells, as well as the function of the complement and coagulation 
pathways. Integrins are transmembrane a& heterodimer receptors expressed on a 
wide variety of cells which are involved in extracellular matrix interactions. The 
ligands for several of the integrins are adhesive extracellular matrix (ECM) 
proteins such as fibronectin, vitronectin, collagens and laminin. 

Proteins or fragments thereof that are able to interfere with vitronectin 
binding to various integrins and to block integrin-mediated cell attachment to 
extracellular matrix proteins are useful in preventing the attachment of the bacteria 
to the host organism, and thus infection of the host. 

The ability to use a protein or antibody that interferes with vitronectin 
binding in a vaccine against H. ducreyi is desirable. These kinds of proteins are 
believed to be highly conserved among strains of a particular type of bacteria in 
that they are the protein molecules that mediate attachment by bonding bacteria to 
host cells, the initial step in the infection process. A vaccine against H. ducreyi 
comprising a protein or antibody that would interfere with vitronectin binding 
-2- 
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would be effective against a broad array of types and strains ofN. ducreyi. The use 
of such a vaccine may prevent adherence of the bacteria to the tissue of the host 
animal. In that adherence is one of the initial step in H. ducreyi infection, 
accordingly, preventing or limiting the infection at this point would be 
advantageous. 

In view of the foregoing, it would be desirable to determine the mechanism 
of serum resistance in H. ducreyi. Additionally, the development of an effective 
vaccine against H. ducreyi would be advantageous. 

Summary of the Invention 

Certain objects, advantages and novel features of the invention will be set 
forth in the description that follows, and will become apparent to those skilled in 
the art upon examination of the following, or may be learned with the practice of 
the invention. 

The present invention is based in the inventor's discovery that a protein, 
referred to herein as DsrA, has been found to play a critical role in the resistance of 
H. ducreyi to normal human serum. 

Accordingly, one aspect of the invention is a polynucleotide (e.g., DNA) 
that encodes the protein DsrA. Particularly preferred is the DNA ofSEQ ID 
NO:l, which encodes the protein DsrA set forth in SEQ ID NO:2. 

An additional aspect of the invention is the isolated protein DsrA, which 
protein may vary in molecular weight between 28 and 35 kilodaltons, depending 
on whether the particular DsrA protein sequence comprises one, two or three 
copies of the amino acid heptamer NTHNINK. 

Expression vectors and host cells expressing DsrA are also an aspect of the 
invention. Antibodies against DsrA and antisense molecules of DsrA are a further 
aspect of the present invention. 

Vaccines against H. ducreyi comprising proteins, polynucleotides and 
expression vectors of DsrA are a further aspect of the invention. 

Also an aspect of this invention is an isogenic mutant (FX5 1 7) ofH. 
ducreyi strain 35000 that does not express DsrA, which mutant finds use in an 
attenuated vaccine against H. ducreyi. 
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The foregoing and other aspects of the present invention are explained in 
detail in the specification set forth below. 

Brief Description of the Drawings 

FIG. 1 is a photograph of Western Blot illustrating the distribution of the 
DsrA protein and summary of serum resistance of H. ducreyi strains. Total cellular 
proteins from geographically diverse H. ducreyi strains were subjected to SDS- 
PAGE and Western blotting using anti-DsrA mouse sera. Bound antibody was 
detected with alkaline phosphatase-conjugated secondary antibody and BCIP/NBT 
substrate. An additional twelve H. ducreyi strains also expressed a 28-35 kDa 
protein which reacted with this serum (data not shown). The names of strains are 
indicated above each lane. Shown to the left of the gel are molecular weight 
standards, where the abbreviation kDa means kilodaltons. R refers to resistant to 
50% NHS; S, sensitive to 50% NHS; an asterisk indicates that resistance to NHS 
was indeterminate. The data in FIG. 1 are compiled from experiments done on at 
least three separate days. 

FIG. 2 is a schematic illustration of the restriction map of the dsrA region 
and PCR products thereof. The dsrA open reading frame is boxed. The restriction 
sites are indicated. The numbered arrows indicate direction and position of the 
dsrA oligos used for PCR. The letter KS and T7 (promoter) refer to the vector 
primers used in the vector-anchored PCR reactions. V-A PCR refers to vector- 
anchored PCR; P refers to a promoter. The jagged lines represent approximately 2 
kb of sequence not shown downstream of the dsrA locus. 

FIG. 3 sets forth the DNA sequence (SEQ ID NO:l) and deduced amino 
acid sequence (SEQ ID NO: 2) of the dsrA locus. The putative -35 and -10 
promoter sequences are indicated and underlined. A putative ribosome binding 
site is labeled RBS and underlined. Twenty one amino acids comprising the signal 
peptide are underlined. The stop codon TAA is indicated with an asterisk. The 
opposing arrows show a potential stem loop transcription terminator. 

FIG. 4 sets forth a comparison of the amino acid sequence of DsrA with 
the UspA2 protein of M. catarrahalis and the YadA protein of Y. enterocolitica. 
Shaded, boxed residues indicate homologous sequences. 
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FIG. 5 is a photograph of a SDS-PAGE/Western blot of parent strain 
35000 and dsrA mutant FX5 1 7. Outer membranes were prepared, solubilized at 
37°C or 100°C and subjected to SDS-PAGE and Coomassie staining (Panel A). 
For the Western blot (panel B), outer membranes were solubilized at 100°C, 
transferred to nitrocellulose and probed with anti-DsrA mouse serum. Bound 
antibody was detected with alkaline phosphatase-conjugated secondary antibody 
and BCIP/NBT substrate. The asterices indicate the positions of the DsrA protein. 
STD, molecular weight standards. 

FIG. 6 is a graphical illustration of the bactericidal killing of parent strain 
3 5000 compared with the bactericidal killing of the dsrA mutant FX5 1 7. 
O Bactericidal killing was performed as described in FIG. 1, except that two serum 

w 

O concentrations were utilized. The data presented in FIGS. 1 and 6 for 35000 with 

= f\ 

Sj 50% sera are the same data- The data presented for 35000 were obtained in 

^0 parallel experiments with FX5 1 7. 

□ FIG. 7 is a photograph of a SDS-PAGE/Western Blot illustrating 

_"" ! Complementation of dsrA mutants. Total cellular proteins from the indicated H. 

g] ducreyi strains were subjected to SDS-PAGE (12%) and Western blotting using 

jy anti-DsrA antisera. Bound antibody was detected with horseradish peroxidase- 

conjugated secondary antibody followed by chemiluminescence. "N" indicates no 
plasmid present; "+" indicates pUNCH 1260 (i.e., contains the entire dsrA ORF 
from strain 35000 and its putative native promoter as illustrated in Fig. 2); 
"-"indicates pLSKS a vector without insert. Below each strain are shown the 
summary of bactericidal killing of the complemented dsrA mutants. Bactericidal 
killing was performed as in FIG. 1 (50% serum), except that the medium used 
contained streptomycin. 

FIG. 8 is a photograph of an SDS-PAGE gel illustrating the analysis of 
LOS as described in Example 16, below. Crude LOS was prepared as described in 
the text and subjected to SDS-PAGE and silver staining. 

FIG. 9 illustrates a comparison of the deduced amino acid sequences of 
dsrA from eight additional H. ducreyi strains. Variable regions 1 and 2 are 
indicated. 
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FIG. 10 illustrates promoter region mutations in the strains CIP A75 and 
CIP All, which do not express dsrA. The 5 base-pair deletions present in strains 
CIP A75 and CIP All are shown as hyphens. 

FIG. 11 is a graphical illustration showing that efficient attachment of if. 
ducreyi to a keratinocyte cell line requires DsrA expression. H. ducreyi were 
added to HaCaT cells at a MOI of between 1-5:1 and incubated for two hours. 
After removal of unbound bacteria by extensive washing, CFUs were determined 
by plating the disrupted monolayer. The data shown in FIG. 11 are taken from 
four experiments. 

FIG. 12 is an autoradiograph of an SDS-PAGE illustrating the affinity 
purification of DsrA from whole cells using biotinylated vitronectins (Vn). 
Biotinylated vitronectins were mixed with surface-iodinated H. ducreyi and 
allowed to bind. After washing unbound vitronectin by centrifugation and washing, 
H. ducreyi were solubilized with a gentle detergent. Total soluble H. ducreyi 
proteins were bound to solid-phase streptavidin-agarose. After washing the 
streptavidin agarose, bound proteins were eluted by boiling in sample buffer and 
analysis by SDS-PAGE and autoradiography. 

Detailed Description of the Preferred Embodiments 

The present invention will now be described more fully hereinafter with 
reference to the accompanying figures, in which preferred embodiments of the 
invention are shown. This invention may, however, be embodied in different 
forms and should not be construed as limited to the embodiments set forth herein. 
Rather, these embodiments are provided so that this disclosure will be thorough 
and complete, and will fully convey the scope of the invention to those skilled in 
the art. 

Unless otherwise defined, all technical and scientific terms used herein 
have the same meaning as commonly understood by one of ordinary skill in the art 
to which this invention belongs. All publications, patent applications, patents, and 
other references mentioned herein are incorporated by reference in their entirety. 

Amino acid sequences disclosed herein are presented in the amino to 
carboxy direction, from left to right. The amino and carboxy groups are not 
presented in the sequence. Nucleotide sequences are presented herein by single 
-6- 
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strand only, in the 5' to 3' direction, from left to right. Nucleotides and amino 
acids are represented herein in the manner recommended by the IUPAC-IUB 
Biochemical Nomenclature Commission, or (for amino acids) by three letter code, 
in accordance with 37 CFR § 1.822 and established usage. See, e.g., Patent In User 
Manual, 99-102 (Nov. 1990) (U.S. Patent and Trademark Office). 

Dsr A is an H. ducreyi outer membrane protein required for the expression 
of serum resistance and is encoded by the gene dsrA, described herein. The 
isolated K ducreyi protein DsrA, and the isolated polynucleotides that encode the 
protein, are aspects of the present invention. The DsrA protein in its monomer 
form varies in molecular weight between 28 and 35kDA between different H. 
ducreyi strains in SDS-PAGE and Western blots. The dsrA locus from several H. 
ducreyi strains was sequenced and the deduced amino acid sequences were greater 
than 85% identical. The major difference between the different strains is found in 
the amino acid sequence, in which either one, two or three copies of the amino acid 
NTHNINK in the VR2 region of the protein present; these repeats account for the 
variability in the monomer form of the DsrA observed in SDS-PAGE. DsrA 
proteins that contain one, two or three copies of the NTHNINK in the VR2 region 
of the protein, and accordingly having a molecular weight of between 28 and 35 
kilodaltons, are all within the scope of the present invention. Additionally, DsrA, 
as used herein, refers to the amino acid sequences of substantially purified DsrA 
obtained from any species, particularly mammalian, including bovine, ovine, 
porcine, murine, equine, and preferably human, from any source whether natural, 
synthetic, semi-synthetic, or recombinant. 

As used herein, in this context, the term "isolated" means that the protein is 
significantly free of other proteins. That is, a composition comprising the isolated 
protein is between 70% and 94% pure by weight. Preferably, the protein is 
purified. As used herein, the term "purified" and related terms means that the 
protein is at least 95% pure by weight, preferably at least 98% pure by weight, and 
most preferably at least 99% pure by weight. 

An "allele" as used herein, is an alternative form of the polynucleotides 
(i.e., genes) encoding DsrA. Alleles may result from at least one mutation in the 
nucleic acid sequence and may result in altered mRNAs or polypeptides whose 
structure or function may or may not be altered. Any given natural or recombinant 
-7- 
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gene may have none, one, or many allelic forms. Common mutational changes 
which give rise to alleles are generally ascribed to natural deletions, additions, or 
substitutions of nucleotides. Each of these types of changes may occur alone, or in 
combination with the others, one or more times in a given sequence. 

"Amino acid sequence," as used herein, refers to an oligopeptide, peptide, 
polypeptide, or protein sequence, and fragment thereof, and to naturally occurring 
or synthetic molecules. Fragments of DsrA are preferably and retain the biological 
activity or the immunological activity of DsrA. Where "amino acid sequence" is 
recited herein to refer to an amino acid sequence of a naturally occurring protein 
JZ molecule, amino acid sequence, and like terms, are not meant to limit the amino 

O acid sequence to the complete, native amino acid sequence associated with the 

ft recited protein molecule. 

4 : " "Amplification", as used herein, refers to the production of additional 

sB copies of a nucleic acid sequence and is generally carried out using polymerase 

p chain reaction (PCR) technologies well known in the art (Dieffenbach, C. W. and 

g G. S. Dveksler, PCR Primer, a Laboratory Manual, Cold Spring Harbor Press, 

| Plainview,N.Y.(1995)). 

pi As used herein, the term "antibody" refers to intact molecules as well as 

fragments thereof, such as Fa, F(ab')2, and Fc, which are capable of binding the 
DsrA protein or an antigenic or epitopic determinant thereof. Antibodies that bind 
DsrA polypeptides can be prepared using intact polypeptides or fragments 
containing small peptides of interest as an immunizing antigen. The polypeptide or 
oligopeptide may be used to immunize an animal and can be derived from the 
translation of RNA or synthesized chemically and can be conjugated to a carrier 
protein, if desired. Commonly used carriers that are chemically coupled to peptides 
include bovine serum albumin and thyroglobulin, keyhole limpet hemocyanin. The 
coupled peptide is then used to immunize the animal (e.g., a mouse, a rat, or a 
rabbit). 

The term "antigenic determinant", as used herein, refers to that fragment of 
a molecule (i.e., an epitope) that makes contact with a particular antibody. When a 
protein or fragment of a protein is used to immunize a host animal, numerous 
regions of the protein may induce the production of antibodies which bind 
specifically to a given region or three-dimensional structure on the protein; these 
-8- 
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regions or structures are referred to as antigenic determinants. An antigenic 
determinant may compete with the intact antigen (i.e., the immunogen used to ' 
elicit the immune response) for binding to an antibody. 

The term "antisense", as used herein, refers to any composition containing 
nucleotide sequences which are complementary to a specific DNA or RNA 
sequence. The term "antisense strand" is used in reference to a nucleic acid strand 
that is complementary to the "sense" strand. Antisense molecules include peptide 
nucleic acids and may be produced by any method including synthesis or 
transcription. Once introduced into a cell, the complementary nucleotides combine 
with natural sequences produced by the cell to form duplexes and block either 
transcription or translation. The designation "negative" is sometimes used in 
reference to the antisense strand, and "positive" is sometimes used in reference to 
the sense strand. 

The terms "complementary" or "complementarity," as used herein, refer to 
the natural binding of polynucleotides under permissive salt and temperature 
conditions by base-pairing. For example, the sequence "A-G-T" binds to the 
complementary sequence "T-C-A". Complementarity between two single-stranded 
molecules may be "partial," in which only some of the nucleic acids bind, or it. 
may be complete when total complementarity exists between the single stranded 
molecules. The degree of complementarity between nucleic acid strands has 
significant effects on the efficiency and strength of hybridization between nucleic 
acid strands. 

A "deletion", as used herein, refers to a change in the amino acid or 
nucleotide sequence and results in the absence of one or more amino acid residues 
or nucleotides. 

The term "derivative", as used herein, refers to the chemical modification 
of a nucleic acid encoding or complementary to DsrA or the encoded DsrA. Such 
modifications include, for example, replacement of hydrogen by an alkyl, acyl, or 
amino group. A nucleic acid derivative encodes a polypeptide which retains the 
biological or immunological function of the natural molecule. A derivative 
polypeptide is one which is modified by glycosylation, pegylation, or any similar 
process which retains the biological or immunological function of the polypeptide 
from which it was derived. 



WO 01/04138 



PCT/US00/18834 



The term "homology", as used herein, refers to a degree of 
complementarity. There may be partial homology or complete homology (i.e., 
identity). A partially complementary sequence that at least partially inhibits an 
identical sequence from hybridizing to a target nucleic acid is referred to using the 
functional term "substantially homologous." The inhibition of hybridization of the 
completely complementary sequence to the target sequence may be examined 
using a hybridization assay (Southern or northern blot, solution hybridization and 
the like) under conditions of low stringency. A substantially homologous sequence 
or hybridization probe will compete for and inhibit the binding of a completely 
homologous sequence to the target sequence under conditions of low stringency. 

O This is not to say that conditions of low stringency are such that non-specific 

•kl 

p binding is permitted; low stringency conditions require that the binding of two 

i j sequences to one another be a specific (i.e., selective) interaction. The absence of 

J3 non-specific binding may be tested by the use of a second target sequence which 

p lacks even a partial degree of complementarity (e.g., less than about 3 0% identity). 

In the absence of non-specific binding, the probe will not hybridize to the second 
IP non-complementary target sequence. 

ry The term "hybridization", as used herein, refers to any process by which a 

strand of nucleic acid binds with a complementary strand through base pairing. 
The term "hybridization complex", as used herein, refers to a complex formed' 
between two nucleic acid sequences by virtue of the formation of hydrogen bonds 
between complementary G and C bases and between complementary A and T 
bases; these hydrogen bonds may be further stabilized by base stacking 
interactions. The two complementary nucleic acid sequences hydrogen bond in an 
antiparallel configuration. A hybridization complex may be formed in solution 
(e.g., Cot or Rot analysis) or between one nucleic acid sequence present in solution 
and another nucleic acid sequence immobilized on a solid support (e.g., paper, 
membranes, filters, chips, pins or glass slides, or any other appropriate substrate to 
which cells or their nucleic acids have been fixed). 

An "insertion" or "addition", as used herein, refers to a change in an amino 
acid or nucleotide sequence resulting in the addition of one or more amino acid 
residues or nucleotides, respectively, as compared to the naturally occurring 
molecule. 

-10- 
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"Nucleic acid sequence" as used herein refers to an oligonucleotide, 
nucleotide, or polynucleotide, and fragments thereof, and to DNA or RNA of 
genomic or synthetic origin which may be single- or double-stranded, and 
represent the sense or antisense strand. "Fragments" are those nucleic acid 
sequences which are greater than 60 nucleotides than in length, and most 
preferably includes fragments that are at least 100 nucleotides or at least 1000 
nucleotides, and at least 10,000 nucleotides in length. 

The term "oligonucleotide" refers to a nucleic acid sequence of at least 
about 6 nucleotides to about 60 nucleotides, preferably about 15 to 30 nucleotides, 
1^ and more preferably about 20 to 25 nucleotides, which can be used in PCR 

O amplification or a hybridization assay, or a microarray. As used herein, 

□ oligonucleotide is substantially equivalent to the terms "amplimers", "primers", 

: "oligomers", and "probes", as commonly defined in the art. 

The term "sample", as used herein, is used in its broadest sense. A 
q biological sample suspected of containing nucleic acid encoding DsrA, or 

t: fragments thereof, or DsrA itself may comprise a bodily fluid, extract from a cell, 

chromosome, organelle, or membrane isolated from a cell, a cell, genomic DNA, 
m RNA, or cDNA (in solution or bound to a solid support, a tissue, a tissue print, and 

the like). 

The terms "stringent conditions" or "stringency", as used herein, refer to 
the conditions for hybridization as defined by the nucleic acid, salt, and 
temperature. These conditions are well known in the art and may be altered in 
order to identify or detect identical or related polynucleotide sequences. Numerous 
equivalent conditions comprising either low or high stringency depend on factors 
such as the length and nature of the sequence (DNA, RNA, base composition), 
nature of the target (DNA, RNA, base composition), milieu (in solution or 
immobilized on a solid substrate), concentration of salts and other components 
(e.g., formamide, dextran sulfate and/or polyethylene glycol), and temperature of 
the reactions (within a range from about 5° C. below the melting temperature of the 
probe to about 20° C. to 25° C. below the melting temperature). One or more 
factors may be varied to generate conditions of either low or high stringency 
different from, but equivalent to, the above listed conditions. 
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A "substitution", as used herein, refers to the replacement of one or more 
amino acids or nucleotides by different amino acids or nucleotides, respectively. 

"Transformation", as defined herein, describes a process by which 
exogenous DNA enters and changes a recipient cell. It may occur under natural or 
artificial conditions using various methods well known in the art. Transformation 
may rely on any known method for the insertion of foreign nucleic acid sequences 
into a prokaryotic or eukaryotic host cell. The method is selected based on the type 
of host cell being transformed and may include, but is not limited to, viral 
infection, electroporation, heat shock, lipofection, and particle bombardment. Such 
"transformed" cells include stably transformed cells in which the inserted DNA is 
capable of replication either as an autonomously replicating plasmid or as part of 
the host chromosome. They also include ceils which transiently express the 
inserted DNA or RNA for limited periods of time. 

Polynucleotides of the present invention include those polynucleotides 
encoding for proteins homologous to, and having essentially the same biological 
properties as, the protein DsrA disclosed herein. Particularly preferred is the DNA 
disclosed herein as SEQ ID NO:l and encoding the protein DsrA given herein 
SEQ ID NO:2. This definition of polynucleotides of the present invention is 
intended to encompass natural allelic sequences thereof. Accordingly, other 
preferred embodiments of the present invention include the polynucleotides set 
forth herein as SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, 
SEQ ID NO:ll, SEQ ID NO:13, SEQ ED NO:15, and SEQ D3 NO:17, which 
polynucleotide sequences encode the protein sequences set forth as SEQ ID NO:4, 
SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID 
NO:14, SEQ ID NO:16, and SEQ ID NO:18, respectively. Isolated DNA or 
cloned genes of the present invention can be of any species of origin, including 
mouse, rat, rabbit, cat, porcine, and human, but are preferably of mammalian 
origin. Polynucleotides that hybridize to any one of the DNA disclosed herein as 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, 
SEQ ID NO:ll, SEQ ID NO:13, SEQ ID NO: 15, or SEQ ID NO:17 (or 
fragments or derivatives thereof which serve as hybridization probes as discussed 
below) and which code on expression for a protein of the present invention (e.g., a 
protein according to SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID 
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NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, or 

SEQ ID NO: 18) are also an aspect of the invention. Conditions which will permit 
other polynucleotides that code on expression for a protein of the present invention 
to hybridize to the any one of DNA of SEQ ID NO:l, SEQ ID NO:3, SEQ ID 
NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:13, SEQ ID 
NO:15, or SEQ ID NO:17 disclosed herein can be determined in accordance with 
known techniques. For example, hybridization of such sequences may be carried 
out under conditions of reduced stringency, medium stringency or even stringent 
conditions (e.g., conditions represented by a wash stringency of 35-40% 
Formamide with 5x Denhardt's solution, 0.5% SDS and lx SSPE at 37°C; 
conditions represented by a wash stringency of 40-45% Formamide with 5x 
Denhardt's solution, 0.5% SDS, and lx SSPE at 42°C; and conditions represented 
by a wash stringency of 50% Formamide with 5x Denhardt's solution, 0.5% SDS 
S and lx SSPE at 42°C, respectively) to any one of the DNA of SEQ ED NO:l, SEQ 

P ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ 

- ID NO:13, SEQ ID NO:15, or SEQ ID NO:17 disclosed herein in a standard 

yi hybridization assay. See, e.g., J. Sambrook et al., Molecular Cloning, A 

nj Laboratory Manual (2d Ed. 1 989) (Cold Spring Harbor Laboratory). In general, 

sequences which code for proteins of the present invention and which hybridize to 
any one of the DNA of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID 
NO:7, SEQ ED NO:9, SEQ ID NO:ll, SEQ ID NO:13, SEQ D> NO:15, or SEQ 
ID NO:17 disclosed herein will be at least 75% homologous, 85% homologous, 
and even 95% homologous or more with the any one of SEQ ID NO:l, SEQ D3 
NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID 
NO:13, SEQ ID NO:15, or SEQ H> NO:17 . Further, polynucleotides that code 
for proteins of the present invention, or polynucleotides that hybridize to any one 
of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, 
SEQ ID NO:ll, SEQ ID NO:13, SEQ ID NO:15, and SEQ ID NO:17, but 
which differ in codon sequence from any one of SEQ ID NO:l, SEQ ID NO:3, 
SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:13, 
SEQ ID NO:15, or SEQ ID NO:17 due to the degeneracy of the genetic code, are 
also an aspect of this invention. The degeneracy of the genetic code, which allows 
different nucleic acid sequences to code for the same protein or peptide, is well 
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known in the literature. See, e.g., U.S. Patent No. 4,757,006 to Toole et al. at Col. 
2, Table 1. 

Although nucleotide sequences which encode DsrA and its variants are 
preferably capable of hybridizing to the nucleotide sequence of the naturally 
occurring DsrA under appropriately selected conditions of stringency, it may be 
advantageous to produce nucleotide sequences encoding DsrA or its derivatives 
possessing a substantially different codon usage. Codons may be selected to 
increase the rate at which expression of the peptide occurs in a particular 
prokaryotic or eukaryotic host in accordance with the frequency with which 

C particular codons are utilized by the host. Other reasons for substantially altering 

the nucleotide sequence encoding DsrA and its derivatives without altering the 

g encoded amino acid sequences include the production of RNA transcripts having 

m oie desirable properties, such as a greater half-life, than transcripts produced 

€= from the naturally occurring sequence. 

O 7116 invention also encompasses production of DNA sequences, or 

fragments thereof, which encode DsrA and its derivatives, entirely by synthetic 
chemistry. After production, the synthetic sequence may be inserted into any of the 
many available expression vectors and cell systems using reagents that are well 
known in the art. Moreover, synthetic chemistry may be used to introduce 
mutations into a sequence encoding DsrA or any fragment thereof. 

Knowledge of the nucleotide sequence as disclosed herein in SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID 
NO:ll, SEQ ID NO:13, SEQ ID NO:15, and SEQ ID NO:17, can be used to 
generate hybridization probes which specifically bind to the DNA of the present 
invention or to mRNA to determine the presence of amplification or 
overexpression of the proteins of the present invention. 

The production of cloned genes, recombinant DNA, vectors, transformed 
host cells, proteins and protein fragments by genetic engineering is well known. 
See, e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual (Cold Spring 
Harbor, N.Y., Cold Spring Harbor Laboratory (1989)), as well as U.S. Patent No. 
4,761,371 to Bell et al. at Col. 6 line 3 to Col. 9 line 65; U.S. Patent No. 4,877,729 
to Clark et al. at Col. 4 line 38 to Col. 7 line 6; U.S. Patent No. 4,912,038 to 
Schilling at Col. 3 line 26 to Col. 14 line 12; and U.S. Patent No. 4,879,224 to 
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Wallner at Col. 6 line 8 to Col. 8 line 59. (Applicant specifically intends that the 
disclosure of all patent references cited herein be incorporated herein in their 
entirety by reference). 

Methods for DNA sequencing which are well known and generally 
available in the art may be used to practice any of the embodiments of the 
invention. The methods may employ such enzymes as the Klenow fragment of 
DNA polymerase I } SEQUENASE® (US Biochemical Corp, Cleveland, Ohio), 
Taq polymerase (Perkin Elmer), thermostable T7 polymerase (Amersham, 
Chicago, 111.), or combinations of polymerases and proofreading exonucleases such 
as those found in the ELONGASE Amplification System marketed by Gibco/BRL 
(Gaithersburg, Md.). Preferably, the process is automated with machines such as 
the Hamilton Micro Lab 2200 (Hamilton, Reno, Nev.), Peltier Thermal Cycler 
(PTC200; MJ Research, Watertown, Mass.) and the ABI Catalyst and 373 and 377 
DNA Sequencers (Perkin Elmer). 

The nucleic acid sequences encoding DsrA may be extended utilizing a 
partial nucleotide sequence and employing various methods known in the art to 
detect upstream sequences such as promoters and regulatory elements. For 
example, one method which may be employed, "restriction-site" PCR, uses 
universal primers to retrieve unknown sequence adjacent to a known locus (Sarkar, 
G., PCR Methods Applic. 2,318-322 (1993)). In particular, genomic DNA is first 
amplified in the presence of primer to a linker sequence and a primer specific to 
the known region. The amplified sequences are then subjected to a second round of 
PCR with the same linker primer and another specific primer internal to the first 
one. Products of each round of PCR are transcribed with an appropriate RNA 
polymerase and sequenced using reverse transcriptase. 

A vector, as defined herein, is a replicable DNA construct. Vectors, such 
as plasmids, are used herein either to amplify DNA encoding the proteins of the 
present invention or to express the proteins of the present invention. An 
expression vector is a replicable DNA construct in which a DNA sequence 
encoding the proteins of the present invention is operably linked to suitable control 
sequences capable of effecting the expression of proteins of the present invention 
in a suitable host. The need for such control sequences will vary depending upon 
the host selected and the transformation method chosen. Generally, control 
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sequences include a transcriptional promoter, an optional operator sequence to 
control transcription, a sequence encoding suitable mRNA ribosomal binding sites, 
and sequences which control the termination of transcription and translation. 
Amplification vectors do not require expression control domains. All that is 
needed is the ability to replicate in a host, usually conferred by an origin of 
replication, and a selection gene to facilitate recognition of transformants. 

Vectors, as used herein, include plasmids, viruses (e.g., adenovirus, 
cytomegalovirus), phage, retroviruses and integratable DNA fragments (i.e., 
fragments integratable into the host genome by recombination). The vector 
replicates and functions independently of the host genome, or may, in some 
O instances, integrate into the genome itself Expression vectors preferably contain a 

O promoter and RNA binding sites which are operably linked to the gene to be 

% I expressed and are operable in the host organism. 

m DNA regions are operably linked or operably associated when they are 

D functionally related to each other. For example, a promoter is operably linked to a 

gj coding sequence if it controls the transcription of the sequence; a ribosome binding 

site is operably linked to a coding sequence if it is positioned so as to permit 

U 

rU translation. Generally, operably linked means contiguous and, in the case of leader 

sequences, contiguous and in reading phase. 

Transformed host cells are cells which have been transformed or 
transfected with vectors containing DNA coding for proteins of the present 
invention need not express protein. Suitable host cells include prokaryotes, yeast 
cells, or higher eukaryotic organism cells. Prokaryote host cells include gram 
negative or gram positive organisms, for example Escherichia coli (£ coli) or 
Bacilli. Higher eukaryotic cells include established cell lines of mammalian origin 
as described below. Exemplary host cells are E. coli W31 10 (ATCC 27,325), E. 
coli B, E. coli X1776 (ATCC 31,537), E. coli 294 (ATCC 31,446). A broad 
variety of suitable prokaryotic and microbial vectors are available. E. coli is 
typically transformed using a derivative of the plasmid pBR322. See Bolivar et al., 
Gene 2, 95 (1 977). Promoters most commonly used in recombinant microbial 
expression vectors include the beta-lactamase (penicillinase) and lactose promoter 
systems (Chang et al., Nature 275, 615 (1978); and Goeddel et al., Nature 281, 544 
(1979), a tryptophan (trp) promoter system (Goeddel et al., Nucleic Acids Res. 8, 
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4057 (1980) and EPO App. Publ. No. 36,776) and the tac promoter (H. De Boer et 
aL, Proc. Natl. Acad. Sci. USA 80, 21 (1983). The promoter and Shine-Dalgarno 
sequence (for prokaryotic host expression) are operably linked to the DNA of the 
present invention, i.e., they are positioned so as to promote transcription of the 
messenger RNA from the DNA. 

Expression vectors should contain a promoter which is recognized by the 
host organism. This generally means a promoter obtained from the intended host. 
While these are commonly used, other microbial promoters are suitable. Details 
concerning nucleotide sequences of many have been published, enabling a skilled 
worker to operably ligate them to DNA encoding the protein in plasmid or viral 
vectors (Siebenlistet al., Cell 20, 269 (1980). The promoter and Shine-Dalgarno 
sequence (for prokaryotic host expression) are operably linked to the DNA 
encoding the desired protein, i.e., they are positioned so as to promote transcription 
of the protein messenger RNA from the DNA. 

Eukaryotic microbes such as yeast cultures may be transformed with 
suitable protein-encoding vectors. See e.g., U.S. Patent No. 4,745,057. 
Saccharomyces cerevisiae is the most commonly used among lower eukaryotic 
host microorganisms, although a number of other strains are commonly available. 
Yeast vectors may contain an origin of replication from the 2 micron yeast plasmid 
or anautonomously replicating sequence (ARS), a promoter, DNA encoding the 
desired protein, sequences for polyadenylation and transcription termination, and a 
selection gene. An exemplary plasmid is YRp7, (Stinchcomb et al., Nature 282, 
39 (1979); Kingsman et al., Gene 7, 141 (1979); Tschemper et al., Gene 10, 157 
(1980). This plasmid contains the trpl gene, which provides a selection marker for 
a mutant strain of yeast lacking the ability to grow in tryptophan, for example 
ATCC No. 44076 or PEP4-1 (Jones, Genetics 85, 12 (1977). The presence of the 
trpl lesion in the yeast host cell genome then provides an effective environment for 
detecting transformation by growth in the absence of tryptophan. 
Suitable promoting sequences in yeast vectors include the promoters for 
metallothionein, 3-phospho-glycerate kinase (Hitzeman et al., J. Biol. Chem. 255, 
2073 (1980) or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg. 7, 149 
(1968); and Holland et al., Biochemistry 17, 4900 (1978), such as enolase, 
glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, 
-17- 



WO 01/04138 



PCT/USOO/18834 



phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, 
pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and 
glucokinase. Suitable vectors and promoters for use in yeast expression are further 
described in R. Hitzeman et al., EPO Publn. No. 73,657. 

Cultures of cells derived from multicellular organisms are a desirable host 
for recombinant protein synthesis. In principal, any higher eukaryotic cell culture 
is workable, whether from vertebrate or invertebrate culture, including insect cells. 
Propagation of such cells in cell culture has become a routine procedure. See 
Tissue Culture, Academic Press, Kruse and Patterson, editors (1973). Examples of 
useful host cell lines are VERO and HeLa cells, Chinese hamster ovary (CHO) cell 
lines, and WI138, BHK, COS-7, CV, and MDCK cell lines. Expression vectors for 
such cells ordinarily include (if necessary) an origin of replication, a promoter 
located upstream from the gene to be expressed, along with a ribosome binding 
site, RNA splice site (if intron-containing genomic DNA is used), a 
polyadenylation site, and a transcriptional termination sequence. 

The transcriptional and translational control sequences in expression 
vectors to be used in transforming vertebrate cells are often provided by viral 
sources. For example, commonly used promoters are derived from polyoma, 
Adenovirus 2, and Simian Virus 40 (SV40). See, e.g., U.S. Patent No. 4,599,308. 
The early and late promoters are useful because both are obtained easily from the 
virus as a fragment which also contains the SV40 viral origin of replication. See 
Fiers et al., Nature 273, 113 (1978). Further, the protein promoter, control and/or 
signal sequences, may also be used, provided such control sequences are 
compatible with the host cell chosen. 

An origin of replication may be provided either by construction of the 
vector to include an exogenous origin, such as may be derived from S V40 or other 
viral source (e.g. Polyoma, Adenovirus, VSV, or BPV), or may be provided by the 
host cell chromosomal replication mechanism. If the vector is integrated into the 
host cell chromosome, the latter may be sufficient. 

Host cells such as insect cells (e.g., cultured Spodoptera jrugiperda cells) 
and expression vectors such as the baculovirus expression vector (e.g., vectors 
derived from Autographa californica MNPV, Trichoplusia ni MNPV, Rachiplusia 
ou MNPV, or Galleria ou MNPV) may be employed to make proteins useful in 
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carrying out the present invention, as described in U.S. Patents Nos. 4,745,05 1 and 
4,879,236 to Smith et al. In general, a baculovirus expression vector comprises a 
baculovirus genome containing the gene to be expressed inserted into the 
polyhedrin gene at a position ranging from the polyhedrin transcriptional start 
signal to the ATG start site and under the transcriptional control of a baculovirus 
polyhedrin promoter. 

In mammalian host cells, a number of viral-based expression systems may 
be utilized. In cases where an adenovirus is used as an expression vector, 
sequences encoding DsrA may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome 
may be used to obtain a viable virus which is capable of expressing DsrA in 
infected host cells (Logan, J. and Shenk, T. (1984) Proc. Natl. Acad. Sci. 81:3655- 
3659). In addition, transcription enhancers, such as the Rous sarcoma virus (RSV) 
enhancer, may be used to increase expression in mammalian host cells. 
Rather than using vectors which contain viral origins of replication, one can 
transform mammalian cells by the method of cotransformation with a selectable 
marker and the chimeric protein DNA. An example of a suitable selectable marker 
is dihydrofolate reductase (DHFR) or thymidine kinase. See U.S. Pat. No. 
4,399,216. Such markers are proteins, generally enzymes, that enable the 
identification of transformant cells, i.e., cells which are competent to take up 
exogenous DNA. Generally, identification is by survival or transformants in 
culture medium that is toxic, or from which the cells cannot obtain critical nutrition 
without having taken up the marker protein. 

In general, those skilled in the art will appreciate that minor deletions or 
substitutions may be made to the amino acid sequences of peptides of the present 
invention without unduly adversely affecting the activity thereof. Thus, peptides 
containing such deletions or substitutions are a further aspect of the present 
invention. In peptides containing substitutions or replacements of amino acids, one 
or more amino acids of a peptide sequence may be replaced by one or more other 
amino acids wherein such replacement does not affect the function of that 
sequence. Such changes can be guided by known similarities between amino acids 
in physical features such as charge density, hydrophobicity/hydrophilicity, size and 
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configuration, so that amino acids are substituted with other amino acids having 
essentially the same functional properties. For example: Ala may be replaced with 
Val or Ser; Val may be replaced with Ala, Leu, Met, or He, preferably Ala or Leu; 
Leu may be replaced with Ala, Val or He, preferably Val or lie; Gly may be 
replaced with Pro or Cys, preferably Pro; Pro may be replaced with Gly, Cys, Ser, 
or Met, preferably Gly, Cys, or Ser; Cys may be replaced with Gly, Pro, Ser, or 
Met, preferably Pro or Met; Met may be replaced with Pro or Cys, preferably Cys; 
His may be replaced with Phe or Gin, preferably Phe; Phe may be replaced with 
His, Tyr, or Trp, preferably His or Tyr; Tyr may be replaced with His, Phe or Trp, 
preferably Phe or Trp; Trp may be replaced with Phe or Tyr, preferably Tyr; Asn 
may be replaced with Gin or Ser, preferably Gin; KGln may be replaced with His, 
Lys, Glu, Asn, or Ser, preferably Asn or Ser; Ser may be replaced with Gin, Thr, 
Pro, Cys or Ala; Thr may be replaced with Gin or Ser, preferably Ser; Lys may be 
replaced with Gin or Arg; Arg may be replaced with Lys, Asp or Glu, preferably 
f«j Lys or Asp; Asp may be replaced with Lys, Arg, or Glu, preferably Arg or Glu; 

and Glu may be replaced with Arg or Asp, preferably Asp. Once made, changes 



Ln 



01 can be routinely screened to determine their effects on function with enzymes. 

As noted above, the present invention provides isolated and purified DsrA 
proteins, such as mammalian (or more preferably human) DsrA. Such proteins can 
be purified from host cells which express the same, in accordance with known 
techniques, or even manufactured synthetically. 

Nucleic acids of the present invention, constructs containing the same and 
host cells that express the encoded proteins are useful for making proteins of the 
present invention. Specific initiation signals may also be used to achieve more 
efficient translation of polynucleotide sequences encoding DsrA. Such signals 
include the ATG initiation codon and adjacent sequences. In cases where 
sequences encoding DsrA, its initiation codon, and upstream sequences are 
inserted into the appropriate expression vector, no additional transcriptional or 
translational control signals may be needed. However, in cases where only coding 
sequence, or a fragment thereof, is inserted, exogenous translational control signals 
including the ATG initiation codon should be provided. Furthermore, the initiation 
codon should be in the correct reading frame to ensure translation of the entire 
insert. Exogenous translational elements and initiation codons may be of various 
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origins, both natural and synthetic. The efficiency of expression may be enhanced 
by the inclusion of enhancers which are appropriate for the particular cell system 
which is used, such as those described in the literature (Scharf, D. et al. Results 
Probl. Cell Differ. 20,125-162(1994)). In addition, a host cell strain may be 
chosen for its ability to modulate the expression of the inserted sequences or to 
process the expressed protein in the desired fashion. Such modifications of the 
polypeptide include, but are not limited to, acetylation, carboxylation, 
glycosylation, phosphorylation, lipidation, and acylation. Post-translational 
processing which cleaves a "prepro" form of the protein may also be used to 
facilitate correct insertion, folding and/or function. Different host cells which have 
specific cellular machinery and characteristic mechanisms for post-translational 
activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38), are available from the 
American Type Culture Collection (ATCC; Bethesda, Md.) and may be chosen to 
yB ensure the correct modification and processing of the foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
expression is preferred. For example, cell lines which stably express Dsr A may be 
transformed using expression vectors which may contain viral origins of 
replication and/or endogenous expression elements and a selectable marker gene 
on the same or on a separate vector. Following the introduction of the vector, cells 
may be allowed to grow for 1 -2 days in an enriched media before they are switched 
to selective media. The purpose of the selectable marker is to confer resistance to 
selection, and its presence allows growth and recovery of cells which successfully 
express the introduced sequences. Resistant clones of stably transformed cells may 
be proliferated using tissue culture techniques appropriate to the cell type. 
Any number of selection systems may be used to recover transformed cell lines. 
These include, but are not limited to, the herpes simplex virus thymidine kinase 
(Wigler, M. et al., Cell 11, 223-32 (1977)) and adenine phosphoribosyltransferase 
(Lowy, I. et al., Cell 22, 817-23 (1980)) genes which can be employed in tk- or 
aprt- cells, respectively. Also, antimetabolite or antibiotic resistance can be used as 
the basis for selection; for example, dhfr which confers resistance to methotrexate 
(Wigler, M. et al., Proc. Natl Acad. Sci. 77, 3567-70 (1980)); npt, which confers 
resistance to the aminoglycosides neomycin and G-41 8 (Colbere-Garapin, F. et al., 
J. Mol. Biol. 150,1-14 (1981)) and als or pat, which confer resistance to 
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chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, supra). 
Additional selectable genes have been described, for example, trpB, which allows 
cells to utilize indole in place of tryptophan, or hisD, which allows cells to utilize 
histinol in place of histidine (Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl. 
Acad. Sci. 85:8047-51). Recently, the use of visible markers has gained popularity 
with such markers as anthocyanins, p-glucuronidase and its substrate GUS, and 
luciferase and its substrate luciferin, being widely used not only to identify 
transformants, but also to quantify the amount of transient or stable protein 
expression attributable to a specific vector system (Rhodes, C. A. et al. (1995) 
Methods Mol. Biol. 55:121-131). 

Although the presence/absence of marker gene expression suggests that the 
gene of interest (i.e., dsrA) is also present, its presence and expression may need to 
be confirmed. For example, if the sequence encoding DsrA is inserted within a 
marker gene sequence, transformed cells containing sequences encoding DsrA can 
be identified by the absence of marker gene function. Alternatively, a marker gene 
can be placed in tandem with a sequence encoding DsrA under the control of a 
single promoter. Expression of the marker gene in response to induction or 
selection usually indicates expression of the tandem gene as well. 

Alternatively, host cells which contain the nucleic acid sequence encoding 
DsrA and express DsrA may be identified by a variety of procedures known to 
those of skill in the art. These procedures include, but are not limited to, DNA- 
DNA or DNA-RNA hybridizations and protein bioassay or immunoassay 
techniques which include membrane, solution, or chip based technologies for the 
detection and/or quantification of nucleic acid or protein. 

As explained further herein, proteins of the present invention are useful as 
immunogens for making antibodies as described herein, and these antibodies and 
proteins provide a "specific binding pair." Such specific binding pairs are useful 
as components of a variety of immunoassays and purification techniques, as is 
known in the art. The proteins of the present invention are of known amino acid 
sequence as disclosed herein, and hence are useful as molecular weight markers in 
determining the molecular weights of proteins of unknown structure. 

The presence of polynucleotide sequences encoding DsrA can be detected 
by DNA-DNA or DNA-RNA hybridization or amplification using probes or 
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fragments or fragments of polynucleotides encoding DsrA. Nucleic acid 
amplification based assays involve the use of oligonucleotides or oligomers based 
on the sequences encoding DsrA to detect transformants containing DNA or RNA 
encoding DsrA. 

A variety of protocols for detecting and measuring the expression of DsrA, 
using either polyclonal or monoclonal antibodies specific for the protein are known 
in the art. Examples include enzyme-linked immunosorbent assay (ELISA), 
radioimmunoassay (RIA), and fluorescence activated cell sorting (FACS). A two- 
site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to 
two non-interfering epitopes on DsrA is preferred, but a competitive binding assay 
may be employed. These and other assays are described, among other places, in 
Hampton, R. et al. (1990; Serological Methods, a Laboratory Manual, APS Press, 
St Paul, Minn.) and Maddox, D. E. et al. (1983; J. Exp. Med. 158:121 1-1216). 
^ A wide variety of labels and conjugation techniques are known by those 

g skilled in the art and may be used in various nucleic acid and amino acid assays. 

p % Means for producing labeled hybridization or PCR probes for detecting sequences 

J related to polynucleotides encoding DsrA include oligolabeling, nick translation, 

fy end-labeling or PCR amplification using a labeled nucleotide. Alternatively, the 

sequences encoding DsrA, or any fragments thereof may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes in vitro by 
addition of an appropriate RNA polymerase such as T7, T3, or SP6 and labeled 
nucleotides. These procedures may be conducted using a variety of commercially 
available kits (Pharmacia & Upjohn, (Kalamazoo, Mich.); Promega (Madison 
Wis.); and U.S. Biochemical Corp., Cleveland, Ohio)). Suitable reporter molecules 
or labels, which may be used for ease of detection, include radionuclides, enzymes, 
fluorescent, chemiluminescent, or chromogenic agents as well as substrates, 
cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding DsrA may be 
cultured under conditions suitable for the expression and recovery of the protein 
from cell culture. The protein produced by a transformed cell may be secreted or 
contained intracellularly depending on the sequence and/or the vector used. As 
will be understood by those of skill in the art, expression vectors containing 
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polynucleotides which encode DsrA may be designed to contain signal sequences 
which direct secretion of DsrA through a prokaryotic or eukaryotic cell 
membrane. Other constructions may be used to join sequences encoding DsrA to 
nucleotide sequence encoding a polypeptide domain which will facilitate 
purification of soluble proteins. Such purification facilitating domains include, but 
are not limited to, metal chelating peptides such as histidine-tryptophan modules 
that allow purification on immobilized metals, protein A domains that allow 
purification on immobilized immunoglobulin, and the domain utilized in the 
FLAGS extension/affinity purification system (Immunex Corp., Seattle, Wash.), 
g The inclusion of cleavable linker sequences such as those specific for Factor XA 

or enterokinase (Invitrogen, San Diego, Calif.) between the purification domain 

Ly 

Q and DsrA may be used to facilitate purification. One such expression vector 

1 - provides for expression of a fusion protein containing DsrA and a nucleic acid 

yfl encoding 6 histidine residues preceding a thioredoxin or an enterokinase cleavage 

p site. The histidine residues facilitate purification on IMAC (immobilized metal 

y ion affinity chromatography) as described in Porath, J. et al., Prot. Exp. Purif. 3, 

fll 263-28 1 (1 992)) while the enterokinase cleavage site provides a means for 

pT| purifying DsrA from the fusion protein. A discussion of vectors which contain 

fusion proteins is provided in Kroll, D. J. et al., DNA Cell Biol. 12, 441-453 

(1993)). 

In addition to recombinant production, fragments of DsrA may be produced 
by direct peptide synthesis using solid-phase techniques (Merrifield J., J. Am. 
Chem. Soc. 85, 2149-2154 (1963)). Protein synthesis may be performed using 
manual techniques or by automation. Automated synthesis may be achieved, for 
example, using Applied Biosystems 431 A Peptide Synthesizer (Perkin Elmer). 
Various fragments of DsrA may be chemically synthesized separately and 
combined using chemical methods to produce the full length molecule. 

Antibodies that specifically bind DsrA (i.e., antibodies which bind to a 
single antigenic site or epitope on the proteins) are useful for a variety of 
diagnostic and therapeutic purposes. Antibodies to DsrA may be generated using 
methods that are well known in the art. Such antibodies may include, but are not 
limited to, polyclonal, monoclonal, chimeric, single chain, Fab fragments, and 
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fragments produced by a Fab expression library. Neutralizing antibodies, (i.e., 
those which inhibit dimer formation) are especially preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, 
rats, mice, humans, and others, may be immunized by injection with DsrA or any 
fragment or oligopeptide thereof which has immunogenic properties. Depending 
on the host species, various adjuvants may be used to increase immunological 
response. Such adjuvants include, but are not limited to, Freund's, mineral gels 
such as aluminum hydroxide, and surface active substances such as lysolecithin, 
pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, 
and dinitrophenol. Among adjuvants used in humans, BCG (bacilli Calmette- 
Guerin) and Corynebacterium parvum are especially preferable. 

Monoclonal antibodies to DsrA may be prepared using any technique 
which provides for the production of antibody molecules by continuous cell lines 
in culture. These include, but are not limited to, the hybridoma technique, the 
human B-cell hybridoma technique, and the EBV-hybridoma technique (Kohler, 
G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. Immunol Methods 
81:31-42; Cote, R. J. et al (1983) Proc. Natl Acad. Sci. 80:2026-2030; Cole, S. P. 
et al. (1984) Mol. CellBiol 62:109-120). Briefly, the procedure is as follows: an 
animal is immunized with DsrA or immunogenic fragments or conjugates thereof. 
For example, haptenic oligopeptides of DsrA can be conjugated to a carrier protein 
to be used as an immunogen. Lymphoid cells (e.g. splenic lymphocytes) are then 
obtained from the immunized animal and fused with immortalizing cells (e.g. 
myeloma or heteromyeloma) to produce hybrid cells. The hybrid cells are screened 
to identify those which produce the desired antibody. 

Human hybridomas which secrete human antibody can be produced by the 
Kohler and Milstein technique. Although human antibodies are especially 
preferred for treatment of human, in general, the generation of stable human- 
human hybridomas for long-term production of human monoclonal antibody can 
be difficult. Hybridoma production in rodents, especially mouse, is a very well 
established procedure and thus, stable murine hybridomas provide an unlimited 
source of antibody of select characteristics. As an alternative to human antibodies, 
the mouse antibodies can be converted to chimeric murine/human antibodies by 
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genetic engineering techniques. See V. T. Oi et al, Bio Techniques 4(4):2 14-221 
(1986); L. K. Sun et al, Hybridoma 5 (1986). 

The monoclonal antibodies specific for DsrA epitopes can be used to 
produce anti-idiotypic (paratope-specific) antibodies. See e.g., McNamara et al., 
Dec. 14, 1984, Science, page 1325; Kennedy, R. C. et al., (1986) Science 232:220. 
These antibodies resemble the DsrA epitope and thus can be used as an antigen to 
stimulate an immune response against H. ducreyi. 

In addition, techniques developed for the production of "chimeric 
antibodies", the splicing of mouse antibody genes to human antibody genes to 
obtain a molecule with appropriate antigen specificity and biological activity can 
be used (Morrison, S. L. et al. (1984) Proc. Natl Acad. Sci. 81, 6851-6855; 
Neuberger, M. S. et al. (1984) Nature 312:604-608; Takeda, S. et al. (1985) Nature 
314:452-454). Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce 
DsrA-specific single chain antibodies. Antibodies with related specificity, but of 
distinct idiotypic composition, may be generated by chain shuffling from random 
combinatorial immunoglobin libraries (Burton D. R. (1991) Proc. Natl. Acad. Sci. 
88,11120-3). 

Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening immunoglobulin libraries or panels of 
highly specific binding reagents as disclosed in the literature (Orlandi, R. et al., 
Proc. Natl. Acad. Sci. 86, 3833-3837 (1989)); Winter, G. et al., (1991) Nature 349, 
293-299(1991)). 

Antibody fragments which contain specific binding sites for DsrA may also 
be generated. For example, such fragments include, but are not limited to, the 
F(ab')2 fragments which can be produced by pepsin digestion of the antibody 
molecule and the Fab fragments which can be generated by reducing the disulfide 
bridges of the F(ab')2 fragments. Alternatively, Fab expression libraries may be 
constructed to allow rapid and easy identification of monoclonal Fab fragments 
with the desired specificity (Huse, W. D. et al. (1989) Science 254:1275-1281). 

Various immunoassays may be used for screening to identify antibodies 
having the desired specificity. Numerous protocols for competitive binding or 
immunoradiometric assays using either polyclonal or monoclonal antibodies with 
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established specificities are well known in the art. Such immunoassays typically 
involve the measurement of complex formation between DsrA and its specific 
antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering DsrA epitopes is preferred, but a 
competitive binding assay may also be employed (Maddox, supra). 

Antibodies may be conjugated to a solid support suitable for a diagnostic 
assay (e.g., beads, plates, slides or wells formed from materials such as latex or 
polystyrene) in accordance with known techniques, such as precipitation. 
Antibodies may likewise be conjugated to detectable groups such as radiolabels 
P (e.g., 35 S, ,25 I, 131 I), enzyme labels (e.g., horseradish peroxidase, alkaline 

phosphatase), and fluorescent labels (e.g., fluorescein) in accordance with known 
O techniques. 

„ ] The proteins and peptides of this invention may be used as antigens in 

Cf immunoassays for the detection of H. ducreyi in various tissues and body fluids 

q e.g., blood, spinal fluid, sputum, etc. A variety of immunoassay systems may be 

used. These include: radioimmunoassays, ELISA assays, "sandwich" assays, 

Q 

CP precipitin reactions, gel diffusion precipitin reactions, immunodiffusion assays, 

flj agglutination assays, fluorescent immunoassays, protein A immunoassays and 

immunoelectrophoresis assays. 

In addition, nucleic acids having the nucleotide sequences of the gene 
encoding DsrA or any nucleotide sequences which hybridize therewith can be 
used as probes in nucleic acid hybridization assays for the detection of H. ducreyi 
in various tissues or body fluids of patients. The probes may be used in any 
nucleic any type of hybridization assay including: Southern blots (Southern, 1975, 
J. Mol. Biol. 98:508); Northern blots (Thomas et al., 1980, Proc. Nafl Acad. Sci. 
U.S.A. 77:520 1 -05); colony blots (Grunstein et al., 1 975, Proc. Nat'l Acad. Sci. 
U.S.A. 72:3961-65), etc. Stringency of hybridization can be varied depending on 
the requirements of the assay. Assays for detecting the polynucleotides encoding 
DsrA in a cell, or the extent of amplification thereof, typically involve, first, 
contacting the cells or extracts of the cells containing nucleic acids therefrom with 
an oligonucleotide that specifically binds to DsrA polynucleotide as given herein 
(typically under conditions that permit access of the oligonucleotide to 
intracellular material), and then detecting the presence or absence of binding of 
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the oligonucleotide thereto. Again, any suitable assay format may be employed 
(see, e.g., U.S. Patent No. 4,358,535 to Falkow et al.; U.S. Patent No. 4,302,204 
to Wahl et al.; 4,994,373 to Stavrianopoulos et al; 4,486,539 to Ranki et al; 
4,563,419 to Ranki et al.; and 4,868,104 to Kurn et al.) (the disclosures of which 
applicant specifically intends be incorporated herein by reference). 

Kits for determining if a sample contains proteins of the present invention 
will include at least one reagent specific for detecting the presence or absence of 
the protein. Diagnostic kits for carrying out antibody assays may be produced in a 
number of ways. In one embodiment, the diagnostic kit comprises (a) an antibody 
P which binds proteins of the present invention conjugated to a solid support and (b) 

[Tj a second antibody which binds proteins of the present invention conjugated to a 

; detectable group. The reagents may also include ancillary agents such as buffering 

y§ 

ry agents and protein stabilizing agents, e.g., polysaccharides and the like. The 

J3 

diagnostic kit may further include, where necessary, other members of the signal- 
producing system of which system the detectable group is a member (e.g., enzyme 
p substrates), agents for reducing background interference in a test, control reagents, 

g apparatus for conducting a test, and the like. A second embodiment of a test kit 

nJ comprises (a) an antibody as above, and (b) a specific binding partner for the 

antibody conjugated to a detectable group. Ancillary agents as described above 
may likewise be included. The test kit may be packaged in any suitable manner, 
typically with all elements in a single container along with a sheet of printed 
instructions for carrying out the test. 

Antisense oligonucleotides and nucleic acids that express the same may be 
made in accordance with conventional techniques. See, e.g., U.S. Patent No. 
5,023,243 to Tullis; U.S. Patent No. 5,149,797 to Pederson et al. The length of 
the antisense oligonucleotide (i.e., the number of nucleotides therein) is not 
critical so long as it binds selectively to the intended location, and can be 
determined in accordance with routine procedures. In general, the antisense 
oligonucleotide will be from 8, 10 or 12 nucleotides in length up to 20, 30, or 50 
nucleotides in length. Such antisense oligonucleotides may be oligonucleotides 
wherein at least one, or all, or the intemucleotide bridging phosphate residues are 
modified phosphates, such as methyl phosphonates, methyl phosphonothioates, 
phosphoromorpholidates, phosphoropiperazidates and phosphoramidates. For 
-28- 



WO 01/04138 



PCT/US00/18834 



example, every other one of the internucleotide bridging phosphate residues may 
be modified as described. In another non-limiting example, such antisense 
oligonucleotides are oligonucleotides wherein at least one, or all, of the 
nucleotides contain a 2' loweralkyl moiety (e.g., Ci-C 4 , linear or branched, 
saturated or unsaturated alkyl, such as methyl, ethyl, ethenyl, propyl, 1-propenyl, 
2-propenyl, and isopropyl). For example, every other one of the nucleotides may 
be modified as described. See also P. Furdon et al., Nucleic Acids Res. 17, 9193- 
9204 (1989); S. Agrawal et al., Proc. Natl. Acad Sci. USA 87, 1401-1405 (1990); 
C. Baker et al., Nucleic Acids Res. 18, 3537-3543 (1990); B. Sproat et al., Nucleic 
Nj Acids Res. 17, 3373-3386 (1989); R. Walder and J. Walder, Proc. Natl. Acad Sci. 

1 USA 85, 501 1-5015 (1988). 

Lii ' 

S hi another embodiment of the invention, DsrA, its catalytic or 

fj immunogenic fragments or oligopeptides thereof, can be used for screening 

03 libraries of compounds in any of a variety of drug screening techniques. The 

g fragment employed in such screening may be free in solution, affixed to a solid 

support, borne on a cell surface, or located intracellularly. The formation of 
51 binding complexes, between DsrA and the agent being tested, may be measured. 

S] Another technique for drug screening which may be used provides for high 

throughput screening of compounds having suitable binding affinity to the protein 
of interest as described in published PCT application WO84/03564. In this 
method, as applied to DsrA, large numbers of different small test compounds are 
synthesized on a solid substrate, such as plastic pins or some other surface. The 
test compounds are reacted with DsrA, or fragments thereof, and washed. Bound 
DsrA is then detected by methods well known in the art. Purified DsrA can also 
be coated directly onto plates for use in the aforementioned drug screening 
techniques. Alternatively, non-neutralizing antibodies can be used to capture the 
peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in 
which neutralizing antibodies capable of binding DsrA specifically compete with a 
test compound for binding DsrA. In this manner, the antibodies can be used to 
detect the presence of any peptide which shares one or more antigenic 
determinants with DsrA. 
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The proteins, peptides, polynucleotides and vectors comprising the 
polynucleotides of the present invention may be used as immunogens in vaccines 
against H. ducreyi, which vaccines are an aspect of the present invention. When 
used as an immunogen, it is not necessary to use the entire DsrA protein, although 
the entire DsrA protein may be used. Polypeptides, fragments, and/or antigenic 
determinants of DsrA may also be used as immunogens in the practice of the 
invention. The vaccines are used to prevent or reduce susceptibility to H. ducreyi 
infection. 

u The vaccines comprise an immunologically effective amount of the 

-0 immunogen in a pharmaceutically acceptable carrier. The combined immunogen . 

o 

y and carrier may be an aqueous solution, emulsion, or suspension. An 

J|: immunologically effective amount is deteraiinable by means known in the art 

ty without undue experimentation, given the teachings contained herein. 

Pharmaceutically acceptable carriers are known to those skilled in the art and 
~"Z include stabilizers, diluents, and buffers. Suitable stabilizers include carbohydrates, 

O such as sorbitol, lactose, mannitol, starch, sucrose, dextran, and glucose and 

q proteins, such as albumin or casein. Suitable diluents include saline, Hanks 

ry Balanced Salts, and Ringers solution. Suitable buffers include an alkali metal 

phosphate, an alkali metal carbonate, or an alkaline earth metal carbonate. 

The immunogens of the invention are immunogenic without adjuvant, 
however adjuvants may increase immunoprotective antibody titers or cell mediated 
immunity response. Such adjuvants could include, but are not limited to, Freund's 
complete adjuvant, Freund's incomplete adjuvant, aluminum hydroxide, aluminum 
phosphate, aluminum oxide or a composition that consists of a mineral oil, such as 
Marcol 52, or a vegetable oil and one or more emulsifying agents, 
dimethyldioctadecyl-ammonixam bromide, Adjuvax (Alpha-Beta Technology), 
Inject Alum (Pierce), Monophosphoryl Lipid A (Ribi Immunochem Research), 
MPL+ TDM (Ribi Immunochem Research), Titermax (CytRx), toxins, toxoids, 
glycoproteins, lipids, glycolipids, bacterial cell walls, subunits (bacterial or viral), 
carbohydrate moieties (mono-, di-, tri- tetra-, oligo- and polysaccharide) various 
liposome formulations or saponins. Other adjuvants that may be included in 
vaccine compositions of the present invention include, but are not limited to: 
surface active substances (e.g., hexadecylamine, octadecylamine, octadecyl amino 
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□ 



acid esters, lysolecithin, dimethyl-dioctadecylammonium bromide), 
methoxyhexadecylgylcerol, pluronic polyols; polyamines (e.g., pyran, 
dextransulfate, poly IC, carbopol); and peptides (e.g., muramyl dipeptide, 
dimetbylglycine, tuftsin). The immunogen may also be incorporated into 
liposomes, or conjugated to polysaccharides and/or other polymers for use in a 
vaccine formulation. Combinations of various adjuvants may be used with the 
conjugate to prepare the immunogen formulation. Exact formulation of the 
vaccine compositions will depend on the particular conjugate, the species to be 
immunized and the route of administration. 
=J The vaccines of the invention are prepared by techniques known to those 

O skilled in the art, given the teachings contained herein. Generally, the irnmunogens 

yj 

p are mixed with the carrier to form a solution, suspension, or emulsion. One or more 

S | of the additives discussed above may be in the carrier or may be added 

J3 subsequently. The vaccine preparations may be dessicated, for example, by freeze 

p drying for storage purposes. If so, they may be subsequently reconstituted into 

liquid vaccines by the addition of an appropriate liquid carrier. 

Any suitable vaccine and method of vaccination (i.e., immunization) 
known in the art may be employed in carrying out the present invention, as long as 
an active immune response against the antigen is elicited. When administered 
according to the present invention, the vaccine induces an active and protective 
immune response against unmodified cancer cells. Exemplary vaccination 
methods include, but are not limited to, "naked DNA" vaccines, viral and bacterial 
vector vaccines, liposome associated antigen vaccines, and peptide vaccines. 
Vaccines may be live vaccines, attenuated vaccines, killed vaccines, or subunit 
vaccines. Methods of vaccinating animals and humans against irnmunogens are 
well-known in the art. See, e.g., S. Crowe et disinfections of the Immune System, 
in Basic and Clinical Immunology, 697-715 (D. P. Stites & A. L Terr, eds., 7th ed. 
1991). 

The vaccines of the present invention are administered to humans or other 
mammals, including bovine, ovine, caprine, equine, leporine, porcine, canine, 
feline and avian species, with humans being particularly preferred. The vaccines 
may administered to human children, including children younger than 18 months 
of age. Preferably, the vaccines of the present invention are administered to those 
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subjects that are at particular risk of developing H. ducreyi infection (;.£?., subjects 
living in geographic locations where H. ducreyi is common). 

The vaccines may be administered in one or more doses. The vaccines may 
be administered by known routes of administration for this type of vaccine, 
including parenteral administration, such as subcutaneous, intramuscular, or 
intravenous administration. Oral administration may also be used, including oral 
dosage forms which are enteric coated. 

The schedule of administration of the vaccine may vary depending on the 
strain of H. ducreyi being used, the age and/or condition of the subject to be 
immunized, the particular formulation of the vaccine, and other factors known to 
those in the art. Subjects may receive a single dose, or may receive a booster dose 
or doses. Annual boosters may be used for continued protection. 

The immunogens of this invention can be formulated as univalent and 
multivalent vaccines. The immunogens (i.e., the protein DsrA) can be mixed, 
conjugated or fused with other antigens, including B or T cell epitopes of other 
antigens. In addition to its utility as a primary immunogen, DsrA can be used as a 
carrier protein to confer or enhance immunogenicity of other antigens. 

When a haptenic peptide of DsrA is used, (i.e., a peptide which reacts with 
cognate antibodies, but cannot itself elicit an immune response), it can be 
conjugated to an immunogenic carrier molecule. For example, an oligopeptide 
containing one or more epitopes of DsrA may be haptenic. Conjugation to an 
immunogenic carrier can render the oligopeptide immunogenic. Preferred carrier 
proteins for the haptenic peptides of DsrA are tetanus toxin or toxoid, diphtheria 
toxin or toxoid and any mutant forms of these proteins such as CRM197. Others 
include exotoxin A of Pseudomonas, heat labile toxin of E. coli and rotaviral 
particles (including rotavirus and VP6 particles). Alternatively, a fragment or 
epitope of the carrier protein or other immunogenic protein can be used. For 
example, the hapten can be coupled to a T cell epitope of a bacterial toxin. 

The peptides or proteins of this invention can be administered as 
multivalent subunit vaccines in combination with other antigens of H. ducreyi. For 
example, they may be administered in conjunction with oligo- or polysaccharide 
capsular components of H. ducreyi such as polyribosylribitolphosphate (PRP). 
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Peptides and proteins having epitopes of DsrA evoke bactericidal 
antibodies which may act synergistically in killing H. ducreyi with antibodies 
against other outer membrane proteins of H. ducreyi. Thus, in an embodiment of 
the invention, DsrA (or a peptide or protein having a common epitope) is 
administered in conjunction with other outer membrane proteins oiH. ducreyi (or 
peptides or proteins having epitopes thereof) to achieve a synergistic bactericidal 
activity. For combined administration with epitopes of other outer membrane 
proteins, the DsrA peptide can be administered separately, as a mixture or as a 
conjugate or genetic fusion peptide or protein. The conjugates can be formed by 
standard techniques for coupling proteinaceous materials. Fusions can be 



;=J expressed from fused gene constructs prepared by recombinant DNA techniques as 

□ described. 

I The immunogens of this invention can be administered as live vaccines. To 

fl this end, recombinant microorganisms are prepared that express the peptides or 

p proteins. The vaccine recipient is inoculated with the recombinant microorganism 

1{ which multiplies in the recipient, expresses the DsrA peptide or protein and evokes 

yj a immune response to H. ducreyi. Preferred live vaccine vectors are pox viruses 

gj such as vaccinia (Paoletti and Panicali, U.S. Pat. No. 4,603,1 12) and attenuated 

Salmonella strains (Stocker, U.S. Pat. No. 4,550,081). 

Live vaccines are particularly advantageous because they lead to a 



prolonged stimulus which can confer substantially long-lasting immunity. When 
the immune response is protective against subsequent H. ducreyi infection, the live 
vaccine itself may be used in a preventative vaccine against H. ducreyi. 

Multivalent live vaccines can be prepared from a single or a few 
recombinant microorganisms that express different epitopes of H. ducreyi. In 
addition, epitopes of other pathogenic microorganisms can be incorporated into the 
vaccine. For example, a vaccinia virus can be engineered to contain coding 
sequences for other epitopes in addition to those of H. ducreyi. Such a recombinant 
virus itself can be used as the immunogen in a mulivalent vaccine. Alternatively, a 
mixture of vaccinia or other viruses, each expressing a different gene encoding for 
different epitopes of outer membrane proteins of H. influenza and/or epitopes of 
other disease causing organisms can be formulated as a multivalent vaccine. 
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An inactivated virus or bacterial vaccine may be prepared. Inactivated 
vaccines are "dead" in the sense that their infectivity has been destroyed, usually 
by chemical treatment (e.g., formaldehyde treatment). Ideally, the infectivity of the 
virus or bacteria is destroyed without affecting the proteins which carry the 
immunogenicity of the vector. In order to prepare inactivated vaccines, large 
quanitites of the recombinant vector expressing the desired epitopes are grown in 
culture to provide the necessary quantity of relevant antigens. A mixture of 
inactivated viruses or bacteria expressing different epitopes may be used for the 
formulation of "multivalent" vaccines. In certain instances, these "multivalent" 

q inactivated vaccines may be preferable to live vaccine formulation because of 

P otent i a I difficulties arising from mutual interference of live viruses administered 

Q together. In either case, the inactivated virus or mixture of viruses should be 

in 

formulated in a suitable adjuvant in order to enhance the immunological response 
to the antigens. Suitable adjuvants include: surface active substances, e.g., 
□ hexadecylamine, octadecyl amino acid esters, octadecylamine, lysolecithin, 

dimethyl-dioctadecylammonium bromide, N, N-dicoctadecyl-N'-N'bis (2- 
hydroxyethyl-propane diamine), methoxyhexadecylglycerol, and pluronic polyols; 
polyamines, e.g., pyran, dextransulfate, poly IC, carbopol; peptides, e.g., muramyl 
dipeptide, dimethylglycine, tuftsin; oil emulsions; and mineral gels, e.g., aluminum 
hydroxide, aluminum phopshate, etc. 

One particularly preferred embodiment of the invention is an attenuated 
vaccine comprising an H. ducreyi strain that does not express DsrA. The H. 
ducreyi strains that do not express DsrA used in these vaccines may be naturally 
occurring strains, or may be recombinant and/or isogenic mutants ofH. ducreyi 
strains that do express the protein. Of these attenuated vaccines, a vaccine 
comprising the H. ducreyi mutant strain FX517 described herein is most preferred. 

The bactericidal antibodies induced by DsrA epitopes can be used to 
passively immunize an individual against H. ducreyi. Passive immunization 
confers short-term protection for a recipient by the administration of the pre- 
formed antibody. Passive immunization can be used on an emergency basis for 
special risks, e.g., young children exposed to contact with subjects already afflicted 
with H. ducreyi infection (chancroid). 
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In view of the foregoing description, the invention also comprises a method 
for inducing an immune response to H. ducreyi in a mammal in order to protect the 
mammal against infection by invasive or non-invasive H. ducreyi. The method 
comprises administering an immunologically effective amount of the immunogens 
of the invention to the host and, preferably, administering the vaccines of the 
invention to the host. 

The following Examples are provided to illustrate the present invention, and 
should not be construed as limiting thereof. Unless otherwise noted, all chemicals 
and reagents were from Sigma Chemicals (St. Louis, MO). Standard recombinant 
g DNA methods were used as described in Sambrook et al. (supra) or following 

manufacturers instructions. 

EXAMPLE 1 

Materials and Methods: Bacterial Strains and Media 

Bacterial strains used in the experiments described herein are shown below 

O in Table 1. For routine growth, H. ducreyi was maintained on chocolate agar 

Lp 

P plates obtained from UNC Hosptial Clinical Microbiology Lab. This medium was 

prepared using Mueller Hinton base and contained no fetal calf serum. When 5% 

lu fetal calf serum was required for optimal growth ( H. ducreyi strains CHIA and 

1 157), Gonococcal medium base (GCB) used for preparation and instructions were 
followed (Difco). Antibiotics were used at the following concentrations for E. 
colt ampicillin, 100 jag/ml; chloramphenicol, 30 ug/ml; kanamycin, 30 u-g/ml; 
ug/ml; streptomycin, 100 jig/ml. YoxH. ducreyi, antibiotics were 
chloramphenicol, 1 ug/ml or streptomycin, 100 ug/ml. 



UJ 
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Table 1. Bacterial strains and plasmids 

Strain/PIasmid Relevant Genotype/Phenotvp e Source/Reference/Isolatec 

£co//K-12 



recA, gyrB 



Bethesda Research Labs 



H. ducreyi 
35000 



wild type 



Stanley Spinola 
Indiana Univ. 



35000 Co-integrate 
beta galactosidase positive 
intermediate in FX5 17 
construction, Cm' 

35000 dsrA, Cm r 



□ 
III 



CIP542 (Canada) 

CIPA77 

CIP 542 (CDC) 



William Albritton 
Robert Munson 
Stephen Morse 
Centers for Disease Control 



H. ducreyi obtained from Pat Totten 

CIP A75 

CHIA 

HD167 

V-1157 

V-1168 

M90-02 

406 

425 

54 

010-2 
HD301 
HD350 



(10) 

Pasteur Institute 

VDRL 

VDRL 

Seattle 

Seattle 

Bahamas 

Mississippi 

Mississippi 

Mississippi 

Dominican Republic 

Thailand 

Kenya 
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Table 1. Bacterial strains and plasmids continued 

Strain/Plasmid Relevant Ge notype/Phenotype Source/Reference/Isolateri 



Plasmids 
pCRII 

pUNCH 1248 
pLS88 

pUNCH 1254 
pUNCH 1255 

pRSM1791 
pUNCH 1256 

pUNCH 1260 
pNC40 



PCR cloning vector Invitrogen 
Kan r , Amp r 

dsrA PCR clone using This work 

primers 14 and 16 
in pCRII vector 

Shuttle plasmid (9) 
Kan r , Str*, Suf 

dsrA subclone. £CoRl fragment This work 
of pUNCfi 1248 in . 
£coRl of pLS88 

mutagenized dsrA; This work 

pUNCH 1254 
mutagenizedwith 
CAT cassette from pNC40 
Kan r , Cm r This work 

Mutagenesis plasmid (6) 
Beta gal + , Amp r 

pUNCH 1255 This work 

(Smal/HinCII/Klenow) into the 
NotI (Klenow) of pRSM 1 79 1 

dsrA PCR clone using This work 

primers 14 and 16 in pLSKS 

source of CAT cassette, Amp r ,Cm r (37) 



EXAMPLE 2 

Outer membrane isolation, analysis, SDS-PAGE and immunoblotting 

Large scale cultures of H. ducreyi were performed in Fernbach flasks with 
1 liter of GCB-I broth containing 5% fetal calf serum and 50 ug/ml heme (Elkins, 
C. Identification and purification of a conserved heme-regulated hemoglobin- 
binding outer membrane protein from Haemophilus ducreyi. Infec Immun. 63, 
1241-1245 (1995)). Cultures of £ coli were performed in LB broth or LB agar 
plates containing appropriate antibiotics. Outer membranes were harvested as 
previously described Id. Protein concentrations were determined using the BCA 
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kit from Pierce (Rockford, IL). SDS-PAGE and Western blotting were performed 
as previously described (11). The lipooligosaccharide (LOS) of H. ducreyi was 
prepared using the method of Hitchcock and Brown (Hitchcock, P.G., and Brown, 
T.M., Morphological heterogeneity among Salmonella LPS chemotypes in silver- 
stained polyacrylamide gels. J. Bacteriol. 154, 269-277 (1983). LOS was analyzed 
by SDS-PAGE and silver staining (Tsai, CM. and Frasch, C.E., A sensitive silver 
stain for detecting lipopolysaccharides in polyacrylamide gels. Anal. Biochem. 
155, 1 15-1 19 (1982)) or Western blotting with Mab 3F1 1 (Apicella, M.A. et al., 
Phenotypic variation in epitope expression of the Neisseria gonorrhoeae 
lipooligosaccharide. Infect Immun. 55:1755-1761 (1987). 

EXAMPLE 3 
N-terminal sequence amino aeid (aa) determination 

The N-terminal aa sequence of DsrA was determined from strain 35000. 
Outer membranes were subjected to preparative SDS-PAGE and Western transfer 
to PVDF. The blot was stained temporarily with Ponceau S protein stain to locate 
the DsrA protein, which in strain 35000 migrates just below the 30 kDa standard 
protein. Strips of the blot were probed with anti-OpaF (generously provided by 
Janice Babcock and Richard Rest of Hahnemann Medical College) of gonococcal 
strain FA1090 and Mab 5C9. Anti-OpaF, for unknown reasons, cross-reacts with 
DsrA and Mab 5C9 reacts with a previously described K ducreyi lipoprotein 
(termed Hip) of similar molecular weight (1 8). These antibodies were used in 
order to unequivocally identify the proper band to sequence. The corresponding 
30kDa-OpaF reactive band from the remainder of the Ponceau S stained blot was 
sequenced. The sequence obtained from the 30 kDa band was QQPPKFAGVS 
SLYSYEYDYG KGKKTKSNEG. This sequence did not match the processed 
mature, N-terminal sequence of Opa or Hip 28 kDa (Hip would be expected not to 
sequence, since it is a lipoprotein). We concluded that these three proteins were 
distinct. 

The antiserum to DsrA was produced as follows. Outer membranes from 
H. ducreyi strain 35000 were electophoresed on large preparative well 12% SDS- 
PAGE gels. The gel was briefly stained and the corresponding 30 kDa band 
excised and electroemted using a Centrilutor (Amicon) following the manufactuers 
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instructions. Mice were immunized a total of 3 times with 25 ug of gel purified 
protein per immunization. Freunds complete adjuvant was used for the first 
immunization and incomplete for the remainder. 



yi 



EXAMPLE 4 
Vector-Anchored PCR 

Two degenerate oligonucleotides deduced from the N-terminal amino acid 
sequence (#6 and #7, Fig. 2) specifically hybridized to a 1 . 1 kb EcoM genomic 
fragment (data not shown). Attempts to clone this fragment using size selected 
DNA using several plasmid vectors were unsuccessful. Therefore a series of three 
separate vector-anchored PCR strategies were utilized to clone the dsrA structural 
gene, upstream flanking DNA and downstream flanking DNA, respectively. The 
first vector-anchored PCR (Fig. 2, V-A PCR 1) used the ligation between the 1.1 
kb EcoRA size-selected DNA and vector pBluescript as template and used 5' 

0 primer #6 and vector primer KS as amplimers. An approximate 700 bp fragment 
was amplified and preliminary sequence obtained. The N-tenriinal sequence 

01 originally obtained from Edman degradation matched the deduced amino acid 

fy sequence of the PCR product, but was not homologous to known sequences in the 

data bases. In contrast, the C-terminus of the gene was homologous to UspA2 and 
YadA (see results below), this suggested the possibility of PCR generated 
artifact(s). To rule out PCR artifact additional PCR was performed. The primers 
used included 5' primers #6, 8 and 9 and 3' primers 1 1 and 12. The latter 4 
primers were derived from the DNA sequence obtained from the original anchored 
PCR product above (Fig. 2 and data not shown). Identically sized products from 
total H. ducreyi chromosomal DNA template (and the original anchored PCR 
product, the + control template were amplified) using 3' primers from the region 
with homology to C-terminal YadA (primers #1 1 and #12) (data not shown). 
Furthermore, Southern hybridization of H. ducreyi chromosomal DNA probed with 
oligos #6, #7, #8, #9, #11, #12 and the PCR product generated from #8 and #12 all 
specifically recognized the 1.1 kb band ECoRl band (Fig. 1 and data not shown). 
It was concluded that the N-terminal aa sequence obtained from the 30 kDa protein 
is found on the same ORF that has C-terminal homology to UspA2/YadA. These 
data established that the open reading frame (ORF) data were correct. 
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To obtain sequence upstream of the structural gene for dsrA, a second 
vector-anchored PCR was used (Fig. 2, V-A PCR 2). Again, the template was the 
ligation between the 1 .1 kb EcoRl size-selected DNA and vector pBluescript but 
the primers used were #12 and vector primer KS. A (1069) bp fragment which 
included the upstream EcoR.1 site (Fig 2.) was amplified. 

To obtain sequence downstream of the dsrA gene a third vector-anchored 
PCR was used (Fig. 2, V-A PCR 3). Southern hybridization identified an 
approximate 4 kb Bgl II fragment which hybridized with dsrA probes and there are 
no BglR sites in the 1.1 kb Ec6R\ fragment. Fragments of 3-5 kb Bgl II restricted 
chromosomal DNA were isolated and ligated to BarriRl, shrimp alkaline 
phosphatase treated pMCL2 1 0 vector. The ligaton reaction was ethanol 
precipitated and amplified using primers 10 and vector primer T7 (promoter), 
yielding an approximately 2.5 kb PCR product. The products of all three vector- 
anchored PCR reactions were sequenced with appropriate primers to obtain 
preliminary sequence and these sequences confirmed one another (data not shown). 

Commercially available PCR tubes (Ready to Go, Pharmacia) were utilized 
for PCR. Analytical PCR (25 ul final volume) utilized single tubes whereas 
preparative PCR combined the "beads" from 4 tubes into single tube (100 ul final 
volume). The MgCk concentration in all PCR reactions was 4 mM. The first two 
vector anchored PCRs used 5 ul of ligation and 25 pm of each primer. The 
conditions for PCR for first two vector anchored PCRs were: hot start 5 min 94C; 
denature 94C; 1 minute annealing, 50C, 1 minute; extension 72, 1 minute. The 
conditions for the third PCR were identical except that the extension time was 3 
min. 

EXAMPLE 5 
DNA sequencing and analysis. 

DNA sequence analysis was performed at the University of North Carolina 
at Chapel Hill Automated Sequencing Facility utilizing Taq terminator chemistry. 
The final sequences presented for strain 35000 in Fig. 2 and for the other H. 
ducreyi strains in Fig. 9 was obtained from PCR products using primers #14 and 
24 which flank the dsrA gene (Fig.l). Both strands of the were completely 
sequenced. The sequence data were assembled using the program AssemblyLIGN 
(IBI). The preliminary sequence for the dsrA structural gene from 35000 obtained 
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by vector-anchored PCR was in complete agreement with the final sequence 
presented (Fig.3). Amino acid alignments were done by Clustal in the program 
GeneJockeyll (Cambridge, UK) and PAM 250 setting. Bestfit (GCG Computer 
Group, Wisconsin) was used to generate similarity and identity scores using a gap 
weight of 8. 

EXAMPLE 6 
Plasmid Constructions 

Plasmid pUNCH 1248 was constructed by PCR. A 900 bp fragment was 
jf amplified from H. ducreyi strain 35000 using primers 14 and 1 6 (FIG. 2) using 

p the conditions described above for the first two vector anchored PCRs. The 

g product was ligated to pCRII following manufacturers directions, tranfbrmed into 

m E. coli DH5 and recombinants identified by restriction analysis. E. coli 

Hi 

g harboring pUNCH 1 248 grew poorly, was propagated only on agar plates to reduce 

[L, to Possibilty of mutation/deletion, and gave rise to an occasional larger colony. 

f£j Subclone 1 254 was constructed by isolating the EcdRl fragment of pUNCH 1248 

S 311(1 Ugation into £coRl restricted pLS88. dsrA of pUNCH 1254 was mutagenized 

by insertion of a CAT (Chloramphenicol Acetyl Transferase) into the open reading 
frame to construct pUNCH 1255. To perform this, a CAT cassette (a BgUl 
fragment from pNC40 was treated with Klenow to fill-in the ends) was ligated into 
the Ndel site of pUNCH 1 254 (previously treated with Klenow to produce blunt 
ends). pUNCH 1256 was constructed by moving the insert from pUNCH 1255 
(containing mutagenized dsrA) into plasmid pRSM1791 for subsequent 
mutagenesis. This was done by isolation of a Smal to HinCU fragment of pUNCH 
1255, Klenow treatment and ligation into the Notl site of pRSMl 791 previously 
treated with Klenow. Transformation of E. coli host was performed and selection 
using Amp and Cm yielded pUNCH 1256. 

EXAMPLE 7 

Construction and characterization of an H. ducreyi dsrA mutant 
An isogenic mutant (FX5 1 7, Table 1) was constructed by allelic 
replacement of the wild-type locus of strain 35000 with the mutation in pUNCH 
1256 using a previous system of mutagenesis described by Bozue et al (Bozue, 
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J. A. et al.; Facile construction of mutations in Haemophilus ducreyi using lacZ as 
a counter-selectable marker. FEMS Microbiology Letters. 164, 269-73 (1998)). In 
this procedure, a mutagenized copy of the locus (containing a chloramphenicol 
(Cm or CAT) cassette) was subcloned into a plasmid able to express lacZ (pUNCH 
1256). H. ducreyi were electroporated and Cm r transformants selected (Elkins et 
al., Characterization of the hgbA locus of Haemophilus ducreyi. Infect Immun. 63, 
2194-2200 (1995); Hansen, EJ. et al., Use of electroporation to construct isogenic 
mutants of Haemophilus ducreyi. J. Bacteriol. 174, 5442-9 (1992)). These 
transformants putatively contained the entire plasmid integrated due to a single 
crossover event (as exemplified by FX516, Table 1). Individual transformants 
were streaked onto Cm medium containing X-gal. Since the product of X-gal is 
highly toxic to H. ducreyi the co-integrates grow as tiny blue colonies. The loss of 
the X-gal sequences and neighboring wild type allele via a resolution of the co- 
integrate results in only the mutant allele being retained (exemplified by FX 517, 
Table 1). These H. ducreyi mutants grew as normal-sized white colonies on the 
medium containing Cm and X-gal similar to other H. ducreyi mutants containing 
CAT cassettes (Elkins, C. et al., Characterization of the hgba locus of 
Haemophilus ducreyi. Infect Immun. 63, 2194-2200 (1995); Elkins, C. et al., Role 
of the Haemophilus ducreyi Ton system in internalization of heme from 
hemoglobin. Infection & Immunity 66,151-60 (1998); Thomas, C. et al., Cloning 
and characterization of tdhA, a locus encoding a TonB-dependent Heme receptor 
from Haemophilus ducreyi. Infect Immun. 66, 1-9 (1998)) and data not shown. 

Southern blot and PCR analysis was used to confirm that an allelic 
replacement occurred in the generation of H. ducreyi mutant FX517. 
Chromosomal DNA was isolated from strains 35000, FX516, and FX517, digested 
with Hindi and subjected to electrophoresis and bidirectional transfer. The two 
blots were probed with either the PCR product of oligos 14 and 16 or the Bgl II 
CAT fragment from pUNCH 40. Digoxigenin-labeled, bound probe was detected 
with alkaline phosphatase labeled anti-digoxigenin antibody (Boehringer 
Mannheim) followed by detection with nitroblue tetrazolium and 5-bromo-4- 
chloro-3-indolyl phosphate (NBT/BCIP). PCR confirmation of the mutant utilized 
primers 14 and 16 which flank the Ndel site (CAT cassette) used for gene 
disruption. 
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EXAMPLE 8 

Complementation of FX517 and other dsrA mutants in trans 
To rule out that the serum susceptibility of dsrA mutant FX517 was due to 
a mutation elsewhere on the chromosome or polar downstream effects, 
complementation in trans was performed. Briefly, we PCR amplified the dsrA and 
surrounding locus using primers 14 and 24 (Fig. 2), Klenow treated the PCR 
product, and restricted the PCR product with HinDlll (which restricts just 
downstream of dsrA, Fig.2). After gel purification, the PCR product was ligated 
into SmaVHinDlll restricted pLSKS (Wood, G.E. et al., Target and cell range of 
the Haemophilus ducreyi hemolysin and its involvement in invasion of human 
ipithelial cells. Infect andlmmun. In Press.) The ligation was ethanol precipitated 
and H. ducreyi strain FX517 electroporated. Streptomycin resistant colonies were 
screened for production of DsrA by Western blotting and confirmed by restriction 
analysis. One experimental transformant, pUNCH \260dsrA, and one vector 
transformant were selected for further study. pUNCH 1260 and the vector pLSKS 
(negative control) were then electroporated into the three naturally occurring dsrA 
mutants (CIP A75, CIP A77, CIP 542 (Can), Table 1). 

EXAMPLE 9 
Serum susceptibility 

The resistance of H.ducreyi to normal human serum was performed as 
previously described (Odumeru; Carbonetti) with the following modifications: An 
18-24 hour culture of H. ducreyi from chocolate agar plates was scraped into GCB 
broth to an OD600 of 0.2. A 10" 4 to 10* 5 dilution was made (approximately 1000 
CFU/ml, depending on the strain) and aliquots mixed with pooled fresh normal 
human serum (NHS) or heat inactivated NHS (56C, 30 min) to a final 
concentration of 25 or 50% NHS. After incubation for 45 minutes at 35C in 5% 
C02, 100 ul aliquotes were plated onto chocolate agar plates and viable counts 
performed after 48 hours. Data are expressed as percent survival in the fresh NHS 
as compared to survival in heat-inactivated NHS (number of CFU survivors in 
fHNS/number of survivors in heated NHS X 100). Strains containing pUNCH 
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1260 orpLSKS were propagated and plated on chocolate agar containing 
streptomycin at 100 ug/ml. 

EXAMPLE 10 

Identification of a 30 kDa protein involved in serum resistance. 

During the course of studies characterizing the H. ducreyi interaction with 
PMNs, a series of Western blots were performed using various antibodies to the 
Opa proteins from gonococci. It was found that a polyclonal antiserum to OpaF of 
gonococcal strain FA1090 reacted at a dilution of 1 :5000 with a protein (DsrA) 
h that varied between 28 and 35 kDa in a panel of strains (data not shown). One 

g strain, CIPA75, did not react. CIPA75 was of interest because it had previously 

been shown to be avirulent in the chilled rabbit model of infection, to be serum 
j|] susceptible, to exhibit reduced adherence to HEp-2 cells and to have a truncated 

LOS (Odumeru, J.A. et al, Role of lipopoly saccharide and complement in 

o susceptibility of Haemophilus ducreyi to human serum. Infectt Immun. 50,495-9 

in 

p (1985); Rice, PA., Molecular basis for serum resistance in Neisseria gonorrhoeae. 

Clinical Microbiology Reviews. 2, SI 12-7 (1989). Specific antisera to DsrA was 
fy generated using DsrA purified by preparative SDS-PAGE and electroeiution of 

outer membranes from H. ducreyi strain 35000. Western blots of several 
geographically diverse lab and clinical isolates were probed with anti-DsrA (Fig. 
1). This was done to confirm that the previous cross-reactivity seen with the anti- 
OpaF serum was due to the presence of DsrA and to ascertain what percentage of 
strains expressed dsrA. The proteins recognized in the DsrA Western blot (Fig. 1) 
and the OpaF Western blot (data not shown) appeared to be identical. Most strains 
in Fig. 1, expressed an immunoreactive protein, except for the previously reported 
avirulent strains CIP A75, CIP A77 (25-27) and CIP542 (Can., obtained from 
Canada) (Alfa, M.J. et al., Use of tissue culture and animal models to identify 
virulence-associated traits of Haemophilus ducreyi. Infection & Immunity 
63:1754-61 (1995)). In contrast, virulent CIP 542 (CDC), obtained from the CDC 
and previously shown to cause a laboratory acquired infection (Trees, D.L. et al., 
Laboratory-acquired infection with Haemophilus ducreyi type strain CIP 542. Med 
Microbiol. 330-337 (1992)), expressed dsrA. Previous studies documented that 
virulent H. ducreyi strains are serum resistant. We performed serum susceptiblity 
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studies of selected H. ducreyi strains which did and did not express dsrA and these 
results are summarized at the bottom of Fig 1. For the purposes of this study, we 
arbitrarily termed a strain serum resistant if there were more than 10% survivors 
when exposed to 50% fNHS serum as compared to NHS. The specific percent 
survivors (+/- sd) for each of the strains tested in Figure 1 are: 35000, 79%; CIP 
A75, ;CIPA77, ; CIP 542 (Can); CIP 542 (CDC); CfflA, ;V-1157, ; M90- 
02; and 406, .Thus, in these initial studies there was a correlation between strains 
tested which expressed detectable dsrA and serum resistance. This correlation 
between the lack expression of dsrA and serum susceptibility in the dsrA mutant 

O strains, some of which also had LOS alteratons [Odumeru, 1985 #576], could 

O 

y merely be coincidental. Therefore additional molecular studies were performed 

culminating in the generation of an isogenic dsrA mutant for biological studies. 

nj 

'f' EXAMPLE 11 

H Molecular Studies 

til 

p Through a series of experiments involving Western blotting, 

01 

P immunoprecipitation and finally N-terminal amino acid sequencing, it was 

rw determined that the DsrA protein was not the same as the previously described 28 

kDa lipoprotein termed Hip (17) (data not shown). The N-terminal amino acid 
sequence of the immunoreactive DsrA 30 kDa protein of strain 35000 was found to 
be: QQPPKFAGVS SLYSYEYDYG KGKKTKSNEG. No known homologies 
were initially detected when this peptide sequence was searched against 
GENBANK, including gonococcal Opa proteins. 

Two degenerate oligonucleotides (#6 and #7) were synthesized based on 
the above N-terminal sequence and found to hybridize specifically to a 1.1 kB 
EcoRl chromosomal band from K ducreyi strain 35000 (data not shown). 
Attempts to clone this fragment were unsuccessful and three separate vector- 
anchored PCR reactions (V-A PCR) were used to amplify the relevant locus and 
surrounding regions (Fig. 2). Preliminary sequencing of the product of V-A PCR 1 
(Fig. 2) identified an ORF that was homologous to the UspA2 protein of 
Moraxella catarrhalis and the YadA protein of Yersinia spp., but only in the C- 
terminal region. Since both of these proteins are implicated in determining 
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important virulence traits (including serum resistance), additional studies were 
undertaken. 

EXAMPLE 12 

DNA and deduced amino acid sequence of the H. ducreyi dsr A locus from 
strain 35000 

The DNA sequence of the dsr A locus, including 100 bp of sequence 
upstream of the ATG start and 126 bp of sequence downstream of the TAA 
tennination codon are presented in Fig. 3. The data presented were obtained from 
p PCR products amplified using primers 14 and 24. Sequences similar to -35 

H (TGATAA) and - 1 0 (TATATT) E coli promoter consensus sequences were found 

O beginning at nt 13 (TTGACA) and nt 35 (TAGAAT) respectively, and were 

m separated by 1 6 nt. A putative ribosome-binding site (TAATGAGG) was found 

beginning 13 nt upstream of the dsr A start codon. Beginning at nt 913 and ending 
O at nt 946 was an inverted hairpin loop containing 13 matched nucleotides, 

p consistent with a transcription terminator. The gene immediately downstream of 

% dsr A and in the opposite orientation was an ORF with homology to the 
W hypothetical protein HI0107 of the genome sequence ofH. ducreyi. The GC 

content of the 1 kb of DNA sequence presented was 34.5%, consistent with the 
AT-rich nature of Haemophilus spp. DNA. 

The dsr A ORF predicted a protein of 28215 daltons, which when processed 
would yield a mature protein of 26375 daltons. This is in agreement with 
migration in SDS-PAGE for strain 35000 (FIG. 1). Comparison of the deduced 
amino acid sequence of DsrA with the N-terminal amino acid sequence revealed 
identity in 28 of 30 amino acids. The first two residues of the mature protein, QQ, 
were unusual in their charges; however, certain versions of mature YadA begin 
with two charged amino acids (se^- below) [Skurnik, 1989 #1051; Rosqvist, 1988 
#1052]. Just preceding the DsrA _:Q residues was the unusual signal peptidase I 
cleave site of TMA. Consistent with the outer membrane localization, DsrA 
contained a carboxyl terminal motif ending with a phenylalanine which is found in 
the majority of integral outer membrane proteins (Struyve, M. et al., Carboxyl- 
terminal phenylalanine is essential for the correct assembly of a bacterial outer 
membrane protein. J. Mol Biol. 218, 141-148 (1991)). The mature DrsA protein 
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was predicted to be very basic with a pi of 9. 1 and which accounts for its poor 
transfer during Western blotting (data not shown). 

Alignment of the DsrA protein with similar proteins is shown in FIG. 4. 
DsrA was most similar to UspA2 and YadA in a region of the C-terminus and was 
most divergent in the N-terminus. Using the Bestfit program, DsrA was 45% 
similar and 40% identical to UspA2; DsrA was 47% similar and 39% identical to 
YadA. It should be noted that both of these heterologous proteins are considerable 
larger than DsrA which may account for such differences in the N-terminal 
domains. The C-terminus of YadA is believed to be anchored in the outer 
membrane and the N-terminus encodes the functional regions of the YadA protein 
(Rogenkamp, A. et al., Substitution of two histidine residues in YadA protein of 
Yersinia enterocolitica abrogates collagen binding, cell adherence and mouse 
virulence. Molecular Microbiology 16, 1207-19 (1995); Roggenkamp, A. et al., 
Deletion of amino acids 29 to 81 in adhesion protein YadA of Yersinia 
enterocolitica serotype 0:8 results in selective abrogation of adherence to 
neutrophils. Infection & Immunity 65, 2506-14 (1996); Tamm, A., et al., 
Hydrophobic domains affect the collagen-binding specificity and surface 
polymerization as well as the virulence potential of the YadA protein of Yersinia 
enterocolitica. Molecular Microbiology. 10,995-1011 (1993)). 



EXAMPLE 13 

Construction and characterization of an H. ducreyi dsrA mutant. 
An isogenic mutant (FX517, Table 1) was constructed by allelic replacement of the 
wild-type locus of strain 35000. Initial attempts to obtain a double crossover with 
a CAT cassette in the cloned gene were unsuccessful using pUNCH 1255 (data not 
shown). Therefore, we used a recently described method to obtain mutants 
(Bozue, J.A. et al., Facile construction of mutations in Haemophilus ducreyi using 
lacZas a counter-selectable marker. FEMS Microbiology Letters. 164:269-73 
(1998)).Using this procedure, several chloramphenicol resistant cointegrates were 
obtained. After streaking each cointegrate onto X-gal chocolate plates, several 
mutants were obtained for each cointegrate and none of mutants expressed dsrA 
(data not shown). One mutant, FX5 17 was selected for further study. Outer 
membranes were made from the parent and mutant strain FX517 and subjected to 
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SDS-PAGE and Coomassie staining or SDS-PAGE and Western blotting (Figures 
5A and 5B, respectively). DsrA is an abundant outer membrane protein in strain 
35000 but is absent in the mutant. No reactivity was obtained from FX514 using 
anti-DsrA antisera (FIG. 5, Panel B) or anti-OpaF (data not shown). Similar to 
UspA2 and YadA, DsrA had a propensity to form multimers, especially when 
solubilized at the lower temperature of 37C (FIG. 5, Panel A, and data not shown). 

The structure of the mutagenized dsrA locus in FX517 was confirmed using 
Southern blotting and PCR. In Southern blots of chromosomal DNA from the 
parent and mutant strains the HinCll band recognized by the dsrA probe increased 
in size approximately 1 kb in the mutant as compared to the parent band. 
Similarly, an identical blot hybridized with the CAT probe recognized only the 
larger Hindi band of the mutant (data not shown). PCR of the 35000 and FX5 17 
dsrA locus with primers flanking the CAT insertion indicated the locus was 
approximately 1 kb larger in the mutant (data not shown). These data are 
consistent with an allelic replacement event. 

EXAMPLE 14 

fu Serum resistance phenotype of the dsrA mutant 

The serum susceptibility of the naturally occurring dsrA mutants and the 
role of the related YadA and UspA2 proteins in mediating serum resistance 
prompted us to test FX517 for serum sensitivity. Serum killing studies of parent 
strain 35000 and dsrA mutant FX517 were performed using 25 and 50% normal 
pooled serum (Fig. 6). FX517 was very susceptible to NHS and demonstrated zero 
or 2% survival in 50% and 25% NHS, respectively. In contrast, parent strain 
35000 was relatively serum resistant, exhibiting 79% and 50% survival in 50% and 
25% NHS (p values 0.002 and 0.004 for 50% and 25% NHS, respectively, using 
Students paired T test). Thus DsrA appeared to be required for expression of a 
serum resistant phenotype. 

EXAMPLE 15 
Complementation of dsrA mutants 
It was possible that a cryptic mutation had occurred during the construction 
of FX517 which could account for its serum suceptibility phenotype. Furthermore, 
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we wished to determine whether the serum susceptiblity of the three naturally 
occurring dsrA mutants could be converted to serum resistance if they expressed 
dsrA. Each dsrA mutant (isogenic mutant FX517 or naturally occurring mutants 
CIP A75, CIP A77, and CIP 542 (Can)) was electroporated with pUNCH 1 260 
(dsrA) or pLSKS (vector control) plasmids. These shuttle plasmids are able to 
replicate in H. ducreyi Strains containing pUNCH 1260, but not pLSKS, 
expressed dsrA (Fig. 7A). Subjectively, it appeared that more DsrA was expressed 
from the strains complemented with the dsrA plasmid than from 35000 (n=4), 
perhaps due to gene dosage or growth on medium containing streptomycin. 
Expression of dsrA from plasmid pUNCH 1260 suggested that the tentatively 
identified promoter (Fig. 3), was driving expression of the cloned dsrA gene since 
very little additional upstream DNA was present and the insert was in the opposite 
direction of the lac promoter in pLSKS. 

Bactericidal killing was performed on each of the complemented dsrA 
mutants (Fig. 7B). For strains FX517, CIP A75, CIP A77, and CIP 542 (Can), 
expression of dsrA from pUNCH 1260 conferred serum resistance. However, for 
strains harboring the plasmid vector lacking an insert serum resistance was not 
conferred. 



EXAMPLE 16 
Lipooligosaccharide expression by H. ducreyi 
In some bacterial systems, mutants in LOS are more serum susceptible. 
Indeed, it was reported by Odumeru that the reason for the serum susceptibility of 
H. ducreyi strains CIP A75 and CIP A77 was due to LOS truncation. It was 
possible that the lack of dsrA expression in dsrA mutants (FX517, CIP A75, CIP 
A77 and CIP 542 (CAN)) resulted in the truncation of LOS directly or indirectly. 
Alternatively repair of dsrA expression in LOS/dsrA apparent double mutants (CIP 
A75 and CIP A77) might affect LOS expression and subsequent serum 
susceptiblity. To address these possibilities, LOS was analyzed by SDS-PAGE 
and silver staining (Fig 8) and Western blotting (data not shown). We compared 
35000 and FX517 LOS (without plasmids) in several silver stained gels and the 
migration patterns were always indistinguishable. Furthermore, Western blotting 
of 35000 and FX517 LOS with anti-LOS Mab 3F1 1 was similar. 
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Silver stained LOS gels of the complemented dsrA mutants were 
indistinguishable between each strain pair containing either pUNCH 1260 (dsrA) 
or pLSKS, respectively. There was a minor variation in a faster migrating LOS 
band for some of the strains (CIP542, no plasmid present) when grown on 
antibiotic free chocolate (Mueller Hinton base) as compared to the same strain 
(CIP542, either plasmid present) grown on streptomycin chocolate (Gonococcal 
medium base). However, it should be noted that within each pair of matched 
strains (expressing or not expressing dsrA), there were no apparent major LOS 
differences. Thus, under the limited conditions examined here, the presence of 
DsrA and not LOS length was the dominant determinant of serum resistance. 

EXAMPLE 17 
Structural Analysis of dsrA in other H. ducreyi strains 
Western blotting of a variety H. ducreyi strains (Fig. 1) suggested strongly 
that DsrA varied in molecular weight and/or amino acid sequence among the 
strains. Furthermore, we desired to understand whether mutations had occurred in 
the naturally occurring dsrA mutants or whether the possibility of phase variation 
could account for their inability to express dsrA. PCR was used to amplify a 1 .2 
kb fragment from 8 additional strains, including the dsrA mutants (Fig. 2, primers 
14 and 24). The deduced amino acid sequence indicated that overall the DsrA 
protein was quite similar between strains (Fig. 9). Two regions with modest 
variability were observed and termed variable region 1 and 2 (VR1 and VR2). 
Variable region 1 included amino acids roughly 90-100 (depending on the strain) 
and a few substitutions and insertions were noted. Variable region 2 contained 
either 1, 2, or 3 identical copies of the heptamer repeat sequence NTHNINK and 
spanned amino acids 174-195 in the various strains. It is likely that the different 
number of repeat sequences was the predominant factor accounting for the variable 
migration seen in SDS-PAGE and Western blotting. Excepting for mutant strain 
CIP542(Can), which contained a stop codon (see below), the sequences for all 
other 8 DsrA proteins were identical after VR2. Thus, DsrA is highly conserved in 
sequence, despite its variable mobility in gels. 
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EXAMPLE 18 
Affinity purification of DsrA using vitronectin (Vn) 

H. ducreyi were surface iodinated using Iodogen-coated tubes as directed 
by the manufacturer. Briefly, to a tube coated with 50 ug of iodogen was added 
0.5 mCi of Nal (Amersham IMS30) and 0.5 ml of 1 X 109.H. ducreyi in 
Phosphate buffered saline (PBS). After incubation for 2 minutes, the labeled 
bacteria were centrifuged and washed in medium to remove unincorporated 
iodine. The procedure labels primarily tyrosine residues on surfaced exposed 
proteins (outer membrane proteins). Each indicated biotinylated Vn was mixed 
P with an aliquot of strain 35000 and strain FX5 1 7 whole surface-iodinated H. 

!=j ducreyi. After Vn binding to H. ducreyi strains (15 min), unbound Vn was 

removed by centrifugation and washing. Bacteria and Vn were solubilized in the 
m detergent ZW3,14 and insoluble material removed by centrifugation for 5 min at 

15,000 x g- The detergent soluble proteins were mixed with strepavidin-agarose 
P solid phase. After incubation (2 hours), extensive washing the strepavidin-agarose 

rj with ZW3,14 in PBS, the samples were boiled in Laemmli sample buffer, and 

% subjected to SDS-PAGE and autoradiography. The results are shown in FIG. 12, 

fU . showing that affinity purification of DsrA from whole cells is possible using 

biotinylated vitronectins (Vn). 



EXAMPLE 19 
Method of attachment of H. ducreyi to Human Cells. 

Efficient attachment of H. ducreyi to a keratinocyte cell line requires DsrA 
expression. H. ducreyi were added to the HaCaT cells at a MOI of 10:1 and 
incubated for 4 hours. After removal of unbound bacteria by extensive washing, 
CFUs were determined by plating the disrupted monolayer. Results are shown in 
FIG. 11, illustrating showing that efficient attachment ofH. ducreyi to a 
keratinocyte cell line requires DsrA expression. Data are from 4 experiments. 

The foregoing is illustrative of the present invention and is not to be 
construed as limiting thereof. The invention is defined by the following claims, 
with equivalents of the claims to be included therein. 
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SEQUENCE LISTING 

SEQ ID NO: 1 DNA SEQUENCE OF 35000 (set forth in FIG. 3) 

ATAAATACGTCATTGACATTTTT TTAATGTAAG GTAGAATAAG AAAGTAAATT 
CTATATTTAC AATCAAGATT GACAATTATT TACTTAATGA GGTGATTATG 
AAAATTAAAT GTTTAGTTGC CGTAGTGGGA TTAGCTTGTT CTACTATTAC 
AACAATGGCT CAGCAGCCGC CAAAGTTTGC TGGAGTATCT TCTTTGTATA 
GCTATGAGTATGACTATGGT AAGGGTAAAT GGACTTGGTC TAATGAAGGC 
u GGTTTCGATA TTAAAGTGCC AGGGATTAAA ATGAAGCCAA AAGAATGGAT 

o TTCTAAACAG GCTACTTATC TTGAATTACA GCATTATATG CCTTATACTC 

O CTGTTCTCGT GACATATGCT CCTGGCGTTT CTCCTAGCCC TATACTGTTA 

y 

p TATCCGATGT CTGATCCTGA TCAACTTGGA ATAAATCGGC AGCAGCTGAA 

Ul ATTGAATTTG TATAGTTATT TTAACGATTT AAGACACGAT TTTAAATTAA 

AAGTTCTTGA TGCACGTATT TCCAAAAATA AACAAAATAT TGATACTATA 

r AGTAAATATT TACTAGAACT GGGTACTTAT TTAGATGATT CTTATCGTAT 

GATGGAACAA AATACACATA ATATCAATAA GTTGTCTAAA GAATTGCAAA 
CTGGTTTAGC CAACCAATCA GCATTGTCTA TGTTAGTGCA ACCAAATGGT 

m GTAGGCAAAA CGAGCGTTTC TGCTGCGGTA GGAGGTTATA GAGATAAAAC 

TGCATTAGCC ATTGGTGTCG GCTCACGCAT TACTGATCGC TTTACCGCTA 
AAGCGGGTGT AGCGTTCAAT ACCTACAATG GCGGCATGTC TTATGGTGCT 
TCTGTTGGTT ATGAATTCTA ATCATTACGT TTAATCACTA ATCGTTTTGG 
TTATAATAAA AAGGCTAAAT GTTTCTCCTC ACATTTAGCC TTTCTTATTT 
ATCTTTGTTA TAGCTTTTGC TGTTATAAAA CCGTTTTTTA GCCACTTTTA 
TTAATTAAGC TTTTAAGCCT ATTCAATCAG TTCTACTTTC ACTTTTTTCA 
CCATATTATC CGCCACTTCT AAAACGGTAA TATTAAGTTG GTTTAGCCTA 
AATTGGGTAC CTTCTATCGG AATTTTTTCT AAATGTTCTA AAATTAAGCC 
GTTAAAGGTG CGGAC 

SEQ ID NO: 2 PROTEIN SEQUENCE OF 35000 (set forth in FIG. 3) 

MKIKCLVAVV GLACSTITTM AQQPPKFAGV SSLYSYEYDY GKGKWTWSNE 
GGFDIKVPGI KMKPKEWISK QATYLELQHY MPYTPVLVTY APGVSPSPIL 
LYPMSDPDQL GINRQQLKLN LYSYFNDLRH DFKLKVLDAR 1SKNKQNIDT 
ISKYLLELGT YLDDSYRMME QNTHNINKLS KELQTGLANQ SALSMLVQPN 
GVGKTSVSAA VGGYRDKTAL AIGVGSRITD RFTAKAGVAF NTYNGGMSYG 
ASVGYEF 
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SEQ ID NO:3 DNA SEQUENCE OF CIPA75 

ATTTTATAATTTACAATACATTTTATATTTTTATATTATATAAATACGTCATTGACATTT 

TTTTAAGGTAGAATAAGAAAGTAAATTCTATATTTACAATCAAGATTGACAATTATTTA 

CTTAATGAGGTGATTATGAAAATTAAATGTTTAGTTGCCGTAGTGGGATTAGCTTGTTC 

TACTATTACAACAATGGCTCAGCAGCCGCCAAAGTTTGCTGGAGTATCTTCTTTGTATA 

GCTATGAGTATGACTATGGTAAGGGTAAATGGACTTGGTCTAATGAAGGCGGTTTCGA 

TATTAAAGTGCCAGGGATTAAAATGAAGCCAAAAGAATGGATTTCTAAACAGGCTACT 

TATCTTGAATTACAGCATTATATGCCTTATACTCCTGTTCTCGTGACATATGCTCATGAC 

GTTCCTCCTAGCTCTATACTGTTATATCCGATGTCTGATCCTGATCAACTTGGAATAAA 

TCGGCAGCAGCTGAAATTGAATTTGTATAGTTATTTTAACGATTTAAGACACGATTTTA 

AATTAAAAGTTCTTGATGCACGTATTTCCAAAAATAAACAAAATATTGATACTATAAG 

TAAATATTTACTAGAACTGGGTACTTATTTAGATGATTCTTATCGTATGATGGAACAAA 

ATACACATAATATCAATAAAAATACACATAATATCAATAAGTTGTCTAAAGAATTGCA 

AACTGGTTTAGCCAACCAATCAGCATTGTCTATGTTAGTGCAACCAAATGGTGTAGGC 

AAAACGAGCGTTTCTGCTGCGGTAGGAGGTTATAGAGATAAAACTGCATTAGCCATTG 

GTGTCGGCTCACGCATTACTGATCGCTTTACCGCTAAAGCGGGTGTAGCGTTCAATACC 

TACAATGGCGGCATGTCTTATGGTGCTTCTGTTGGTTATGAATTCTAATCATTACGTTTA 

ATCACTAATCGTTTTGGTTATAATAAAAAGGCTAAATGTTTCTCCTCACATTTAGCCTTT 

CTTATTTATCTTTGTTATAGCTTTTGCTGTTATAAAACCGTTTTTTAGCCACTT^ 

TTAAGCTTTTAAGCCTATTCAATCAGTTCTACTTTCACTTTTTTCACCATATTATCCGCC 

ACTTCTAAAACGGTAATATTAAGTTGGTTTAGCCTAAATTGGGTACCTTCTATCGGAAT 

TTTTTCTAAATGTTCTAAAATTAA 

SEQ ID NO:4 PROTEIN SEQUENCE OF CIPA75 (set forth in FIG. 9} 

MKIKCLVAVV GLACSTITTM AQQPPKFAGV SSLYSYEYDY GKGKWTWSNE 
GGFDIKVPGI KMKPKEWISK QATYLELQHY MPYTPVLVTY AHDVPPSSIL 
LYPMSDPDQL G1NRQQLKLN LYSYFNDLRH DFKLKVLDAR ISKNKQNIDT 
ISKYLLELGT YLDDSYRMME QNTHNINKNT HNINKLSKEL QTGLANQSAL 
SMLVQPNGVG KTSVSAAVGG YRDKTALAIG VGSRITDRFT AKAGVAFNTY 
NGGMSYGASV GYEF 

SEQ ID NO: 5 DNA SEQUENCE OF CIPA77 

ATTTTATAATTTACAATACATTTTATATTTTTATATTATATAAATACGTC 

TTTTAAGGTAGAATAAGAAAGTAAATTCTATATTTACAATCAAGATTGACAATTATTTA 

CTTAATGAGGTGATTATGAAAATTAAATGTTTAGTTGCCGTAGTGGGATTAGCTTGTTC 

TACTATTACAACAATGGCTCAGCAGCCGCCAAAGTTTGCTGGAGTATCTTCTTTGTATA 

GCTATGAGTATGACTATGGTAAGGGTAAATGGACTTGGTCTAATGAAGGCGGTTTCGA 

TATTAAAGTGCCAGGGATTAAAATGAAGCCAAAAGAATGGATTTCTAAACAGGCTACT 
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TATCTTGAATTACAGCATTATATGCCTTATACTCCTGTTCTCGTGACATATGCTCATGAC 

GTTCCTCCTAGCTCTATACTGTTATATCCGATGTCTGATCCTGATCAACTTGGAATAAA 

TCGGCAGCAGCTGAAATTGAATTTGTATAGTTATTTTAACGATTTAAGACACGATTTTA 

AATTAAAAGTTCTTGATGCACGTATTTCCAAAAATAAACAAAATATTGATACTATAAG 

TAAATATTTACTAGAACTGGGTACTTATTTAGATGATTCTTATCGTATGATGGAACAAA 

ATACACATAATATCAATAAAAATACACATAATATCAATAAGTTGTCTAAAGAATTGCA 

AACTGGTTTAGCCAACCAATCAGCATTGTCTATGTTAGTGCAACCAAATGGTGTAGGC 

AAAACGAGCGTTTCTGCTGCGGTAGGAGGTTATAGAGATAAAACTGCATTAGCCATTG 

GTGTCGGCTCACGCATTACTGATCGCTTTACCGCTAAAGCGGGTGTAGCGTTCAATACC 

TACAATGGCGGCATGTCTTATGGTGCTTCTGTTGGTTATGAATTCTAATCATTACGTTTA 

ATCACTAATCG 

SEQ ID NO:6 PROTEIN SEQUENCE OF CIPA77 (set forth in FIG. 9) 

MKIKCLVA W GLACSTITTM AQQPPKFAGV SSLYSYEYDY GKGKWTWSNE 
GGFDIKVPGI KMKPKEWISK QATYLELQHY MPYTPVLVTY AHDVPPSSIL 
LYPMSDPDQL GINRQQLKLN LYSYFNDLRH DFKLKVLDAR ISKNKQNIDT 
ISKYLLELGT YLDDSYRMME QNTHNINKNT HNINKLSKEL QTGLANQSAL 
SMLVQPNGVG KTSVSAAVGG YRDKTALAIG VGSRITDRFT AKAGVAFNTY 
NGGMSYGASV GYEF 

SEQ ID NO: 7 DNA SEQUENCE OF CIP542 (Can) 

TTTTATAATTTACAATACATTTTATATTTTT^ 

TTAATGTAAGGTAGAATAAGAAAGTAAATTCTATATTTACAATCAAGATTGACAATTA 
TTTACTTAATGAGGTGATTATGAAAATTAAATGTTTAGTTGCCGTAGTGGGATTAGCTT 

TATAGCTATGAGTATGACTATGGTAAGGGTAAATGGACTTGGTCTAATGAAGGCGGTT 

TCGATATTAAAGTGCCAGGGATTAAAATGAAGCCAAAAGAATGGATTTCTAAACAGGC 

TACTTATCTTGAATTACAGCATTATATGCCTTATACTCCTGTTCTCGTGACATATGCTCC 

TGGCGTTTCTCCTAGCCCTATACTGTTATATCCGATGTCTGATCCTGATCAACTTGGAAT 

AAATCGGCAGCAGCTGAAATTGAATTTGTATAGTTATTTTAACGATTTAAGACACGATT 

TTAAATTAAAAGTTCTTGATGCACGTATTTCCAAAAATAAACAAAATATTGATACTATA 

AGTAAATATTTACTAGAACTGGGTACTTATTTAGATGATTCTTATCGTATGATGGAACA 

AAATACACATAATATCAATAAGTTGTCTAAAGAATTGCAAACTGGTTTAGCCAACCAA 

TCAGCATTGTCTATGTTAGTGCAACCAAATGGTGTAGGCAAAACGAGCGTTTCTGCTGC 

GGTAGGAGGTTATAGAGATAAAACTGCATTAGCCATTGGTGTCGGCTCACGCATTACT 

GATCGCTTTACCGCTAAAGCGGGTGTAGCGTTCAATACCTTCTATCGGAATTTTTTCTA 

AATGTTCTAAAATTA 
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SEQ ID NO: 8 PROTEIN SEQUENCE OF CIP542 (Can) (set forth in FIG. 9) 

MKIKCLVA W GLACSTITTM AQQPPKFAGV SSLYSYEYDY GKGKWTWSNE 
GGFDIKVPGI KMKPKEWISK QATYLELQHY MPYTPVLVTY APGVSPSPIL 
LYPMSDPDQL GINRQQLKLN LYSYFNDLRH DFKLKVLDAR ISKNKQNIDT 
ISKYLLELGT YLDDSYRMME QNTHNINKLS KELQTGLANQ SALSMLVQPN 
GVGKTSVSAA VGGYRDKTAL AIGVGSRITD RFTAKAGVAF NT 

SEQ ID NO: 9 DNA SEQUENCE OF CIP542 (CDC) 

AATGGCCATTTTATAATTTACAATACATTTTATATTTTTATATTATATAAA 

GACATTTTTTTAATGTAAGGTAGAATAAGAAAGTAAATTCTATATTTACAATCAAGATT 

GACAATTATTTACTTAATGAGGTGATTATGAAAATTAAATGTTTAGTTGCCGTAGTGGG 

ATTAGCTTGTTCTACTATTACAACAATGGCTCAGCAGCCGCCAAAGTTTGCTGGAGTAT 

CTTCTTTGTATAGCTATGAGTATGACTATGGTAAGGGTAAATGGACTTGGTCTAATGAA 

GGCGGTTTCGATATTAAAGTGCCAGGGATTAAAATGAAGCCAAAAGAATGGATTTCTA 

AACAGGCTACTTATCTTGAATTACAGCATTATATGCCTTATACTCCTGTTCTCGTGACA 

TATGCTCCTGGCGTTTCTCCTAGCCCTATACTGTTATATCCGATGTCTGATCCTGATCAA 

CTTGGAATAAATCGGCAGCAGCTGAAATTGAATTTGTATAGTTATTTTAACGATTTAAG 

ACACGATTTTAAATTAAAAGTTCTTGATGCACGTATTTCCAAAAATAAACAAAATATTG 

ATACTATAAGTAAATATTTACTAGAACTGGGTACTTATTTAGATGATTCTTATCGTATG 

ATGGAACAAAATACACATAATATCAATAAGTTGTCTAAAGAATTGCAAACTGGTTTAG 

CCAACCAATCAGCATTGTCTATGTTAGTGCAACCAAATGGTGTAGGCAAAACGAGCGT 

TTCTGCTGCGGTAGGAGGTTATAGAGATAAAACTGCATTAGCCATTGGTGTCGGCTCA 

CGCATTACTGATCGCTTTACCGCTAAAGCGGGTGTAGCGTTCAATACCTACAATGGCG 

GCATGTCTTATGGTGCTTCTGTTGGTTATGAATTCTAATCATTACGTTTAATCACTAATC 

GTTTTGGTTATAATAAAAAGGCTAAATGTTTCTCCTCACATTTAGCCTTTCTTATTTA 

TTTGTTATAGCCTTTTGCTGTTATAAAACCGTTTTTTAGCCACTTTTAT^ 

TAAGCCTATTCAATCAGTTCTACTTTCACTTTTTTCACCATATTATCCGCCACTTCTAAA 

ACGGTAATATTAAGTTGGTTTAGCCTAAATTGGGTACCTTCTATCGGAATTTTTTCTAA 

ATGTTCTAAAA TTA A 

SEQ ID NO:10 PROTEIN SEQUENCE OF CIP542 (CDC) (set forth in FIG. 
9) 

MKIKCLVAVV GLACSTITTM AQQPPKFAGV SSLYSYEYDY GKGKWTWSNE 
GGFDIKVPGI KMKPKEWISK QATYLELQHY MPYTPVLVTY APGVSPSPIL 
LYPMSDPDQL GINRQQLKLN LYSYFNDLRH DFKLKVLDAR ISKNKQNIDT 
ISKYLLELGT YLDDSYRMME QNTHNINKLS KELQTGLANQ SALSMLVQPN 
GVGKTSVSAA VGGYRDKTAL AIGVGSRITD RFTAKAGVAF NTYNGGMSYG 
ASVGYEF 
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SEQ ID NO: 11 DNA SEQUENCE OF CfflA 

TATTTACAATCAAGATTGACAATTATTTACTTAATGAGGTGATTATGAAAATTAAATGT 

TTAGTTGCCGTAGTGGGATTAGCTTGTTCTACTATTACAACAATGGCTCAGCAGCCGCC 

AAAGTTTGCTGGAGTATCTTCTTTGGATAGCTATGAGTATGACTATGGTAAGGGTAAAT 

GGACTTGGTCTGAAAAAGACGGTTTCGATATTAAAGCGCCAGGGATTAAAATGAAGeC 

AAAAAAATGGATTTCTAGACAGGCTACTTATCTTGGATTACAGCATTATATGCCTTATA 

CTCCTGTTCTCGTGACATATGCTTCTGCAGAACCTAACACTGTACTGTTATATCCGATG 

CCTGATCCTGATCAACTTGGAATAAATCGGCAGCAGCTGAAATTGAATTTGTATAGTTA 

TTTTAACGATTTAAGACACGGTTTTAAATTAAATGTTCTTGATGCACGTATTTCCCAAA 

ATAAACAAAATATTGATACTATAAGTGAATATTTACTAAAACTGGGTACTTATTTAGAT 

AGTTCTTATCGTATGATGGAACAAAATACACATAATATCAATAAAAATACACATAATA 

TCAATAAGTTGTCTAAAGAATTGCAAACTGGTTTAGCCAACCAATCAGCATTGTCTATG 

TTAGTGCAACCAAATGGTGTAGGCAAAACGAGCGTTTCTGCTGCGGTAGGAGGTTATA 

GAGATAAAACTGCATTAGCCATTGGTGTCGGCTCACGCATTACTGATCGCTTTACCGCT 

AAAGCGGGTGTAGCGTTCAATACCTACAATGGCGGCATGTCTTATGGTGCTTCTGTTGG 

TTATGAATTCTAATCATTACGTTTAATCACTAATCGTTTTGGTTATAATAAAAAGGCTA 

AATGTTTCTCCTCACATTTAGCCTTTCTTATTTATCTTTGT 

SEQ ID NO: 12 PROTEIN SEQUENCE OF CHIA(set forth in FIG. 9) 

MKIKCLVAVV GLACSTITTM AQQPPKFAGV SSLDSYEYDY GKGKWTWSEK 
DGFDIKAPGI KMKPKKWISR QATYLGLQHY MPYTPVLVTY ASAEPNTVLL 
YPMPDPDQLG INRQQLKLNL YSYFNDLRHG FKLNVLDARI SQNKQNIDTI 
SEYLLKLGTY LDSSYRMMEQ NTHNINKNTH NINKLSKELQ TGLANQSALS 
MLVQPNGVGK TSVSAAVGGY RDKTALAIGV GSPJTDRFTA KAGVAFNTYN 
GGMSYGASVG YEF 

SEQ ID NO: 13 DNA SEQUENCE OF V-1157 

CTTTTATAATTTACAATACATTTTATATTTTTATATTATATAAATACGTCATTGACATTT 

TTTTAATGTAAGGTAGAATAAGAAAGTAAATTCTATATTTACAATCAAGATTGACAATT 

ATTTACTTAATGAGGTGATTATGAAAATTAAATGTTTAGTTGCCGTAGTGGGATTAGCT 

TGTTCTACTATTACAACAATGGCTCAGCAGCCGCCAAAGTTTGCTGGAGTATCTTCTTT 

GTATAGCTATGAGTATGACTATGGTAAGGGTAAATGGACTTGGTCTAATGAAGGCGGT 

TTCGATATTAAAGTGCCAGGGATTAAAATGAAGCCAAAAGAATGGATTTCTAAACAGG 

CTACTTATCTTGAATTACAGCATTATATGCCTTATACTCCTGTTCTCGTGACATCTGCTC 

CTGACGTTCCTCCTAGCTCTATACTGTTATATCCGATGTCTGATCCTGATCAACTTGGA 

ATAAATCGGCAGCAGCTGAAATTGAATTTGTATAGTTATTTTAACGATTTAAGACACG 

ATTTTAAATTAAAAGTTCTTGATGCACGTATTTCCAAAAATAAACAAAATATTGATACT 
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ATAAGTAAATATTTACTAGAACTGGGTACTTATTTAGATGGTTCTTATCGTATGATGGA 
ACAAAATACACATAATATCAATAAAAATACACATAATATCAATAAAAATACACATAAT 
ATCAATAAGTTGTCTAAAGAATTGCAAACTGGTTTAGCCAACCAATCAGCATTGTCTAT 
GTTAGTGCAACCAAATGGTGTAGGCAAAACGAGCGTTTCTGCTGCGGTAGGAGGTTAT 
AGAGATAAAACTGCATTAGCCATTGGTGTCGGCTCACGCATTACTGATCGCTTTACCGC 
TAAAGCGGGTGTAGCGTTCAATACCTACAATGGCGGCATGTCTTATGGTGCTTCTGTTG 
GTTATGAATTCTAATCATTACGTTTAATCACTAATCGTTTTGGTTATAATAAAAAGGCT 
AAATGTTTCTCCTCACATTTAGCCTTTCTTATTTATCTTTGTTATAGCTTTTGCTGTTATA 
AAACCGl i 1 1 1 1 AGCCACTTTTATTAATTA AGCTTTTAAGCCTATTCAATCAGTTCTACT 
ttcac l ■ nrn CACCATATTATCCGCCACTTCTA A AACGGTA ATATTAAGTTGGTTTAGC 
CTAAATTGGGTACCTTCTATCGGAATTTTTTCTAAATGTTCTAAAATTAA 

SEQ ID NO: 14 PROTEIN SEQUENCE OF V-1157 (set forth in FIG. 9) 

MKIKCLVA W GLACSTITTM AQQPPKFAGV SSLYSYEYDY GKGKWTWSNE 
GGFDIKVPG1 KMKPKEWISK QATYLELQHY MPYTPVLVTS APDVPPSSIL 
LYPMSDPDQL GINRQQLKLN LYSYFNDLRH DFKLKVLDAR ISKNKQNIDT 
ISKYLLELGT YLDGSYRMME QNTHNINKNT HNINKNTHNI NKLSKELQTG 
LANQSALSML VQPNGVGKTS VSAAVGGYRD KTALAIGVGS RITDRFTAKA 
GVAFNTYNGG MSYGASVGYE F 



SEQ ID NO: 15 DNA SEQUENCE OF M90-02 

TTTTATAATTTACAATACATTTTATATTTTTATATTATATAAATACCGTCATTGACATTT 

TTTTAATGTAAGGTAGAATAAGAAAGTAAATTCTATATTTACAATCAAGATTGACAATT 

ATTTACTTAATGAGGTGATTATGAAAATTAAATGTTTAGTTGCCGTAGTGGGATTAGCT 

TGTTCTACTATTACAACAATGGCTCAGCAGCCGCCAAAGTTTGCTGGAGTATCTTCTTT 

GTATAGCTATGAGTATGACTATGGTAAGGGTAAATGGACTTGGTCTAATGAAGGCGGT 

TTCGATATTAAAGTGCCAGGGATTAAAATGAAGCCAAAAGAATGGATTTCTAAACAGG 

CTACTTATCTTGAATTACAGCATTATATGCCTTATACTCCTGTTCTCGTGACATCTGCTC 

CTGACGTTTCTCCTAGCTCTATCTCTATACTGTTATATCCGATGTCTGATCCTGATCAAC 

TTGGAATAAATCGGCAGCAGCTGAAATTGAATTTGTATAGTTATTTTAACGATTTAAGA 

CACGATTTTAAATTAAAAGTTCTTGATGCACGTATTTCCAAAAATAAACAAAATATTGA 

TACTATAAGTAAATATTTACTAGAACTGGGTACTTATTTAGATGGTTCTTATCGTATGA 

TGGAACAAAATACACATAATATCAATAAAAATACACATAATATCAATAAAAATACACA 

TAATATCAATAAGTTGTCTAAAGAATTGCAAACTGGTTTAGCCAACCAATCAGCATTGT 

CTATGTTAGTGCAACCAAATGGTGTAGGCAAAACGAGCGTTTCTGCTGCGGTAGGAGG 

TTATAGAGATAAAACTGCATTAGCCATTGGTGTCGGCTCACGCATTACTGATCGCTTTA 

CCGCTAAAGCGGGTGTAGCGTTCAATACCTACAATGGCGGCATGTCTTATGGTGCTTCT 
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GTTGGTTATGAATTCTAATCATTACGTTTAATCACTAATCGTTTTGGTTATAATAAAAA 
GGCTAAATGTTTCTCCTCACATTTAGCCTTTTCTTATTTATCTTT 

SEQ ID NO: 16 PROTEIN SEQUENCE OF M90-02 (set forth in FIG. 9) 

MKIKCLVAVV GLACSTITTM AQQPPKFAGV SSLYSYEYDY GKGKWTWSNE 
GGFDIKVPGI KMKPKEWISK QATYLELQHY MPYTPVLVTS APDVSPSSIS 
ILLYPMSDPD QLGINRQQLK LNLYSYFNDL RHDFKLKVLD ARISKNKQNI 
DTISKYLLEL GTYLDGSYRM MEQNTHNINK NTHNINKNTH NINKLSKELQ 
TGLANQSALS MLVQPNGVGK TSVSAAVGGY RDKTALAIGV GSPJTDRFTA 
KAGVAFNTYN GGMSYGASVG YEF 



SEQ ID NO: 17 DNA SEQUENCE OF 406 

ATTTTATAATTTACAATACATTTTTATT^ 

TTAATGTAAGGTAGAATAAGAAAGTAAATTCTATATTTACAATCAAGATTGACAATTA 

TTTACTTAATGAGGTGATTATGAAAATTAAATGTTTAGTTGCCGTAGTGGGATTAGC1T 

GTTCTACTATTACAACAATGGCTCAGCAGCCGCCAAAGTTTGCTGGAGTATCTTCTTTG 

TATAGCTATGAGTATGACTATGGTAAGGGTAAATGGACTTGGTCTAATGAAGGCGGTT 

TCGATATTAAAGTGCCAGGGATTAAAATGAAGCCAAAAGAATGGATTTCTAAACAGGC 

TACTTATCTTGAATTACAGCATTATATGCCTTATACTCCTGTTCTCGTGACATATGCTCC 

TGGCGTTTCTCCTAGCCCTATACTGTTATATCCGATGTCTGATCCTGATCAACTTGGAAT 

AAATCGGCAGCAGCTGAAATTGAATTTGTATAGTTATTTTAACGATTTAAGACACGATT 

TTAAATTAAAAGTTCTTGATGCACGTATTTCCAAAAATAAACAAAATATTGATACTATA 

AGTAAATATTTACTAGAACTGGGTACTTATTTAGATGATTCTTATCGTATGATGGAACA 

AAATACACATAATATCAATAAGTTGTCTAAAGAATTGCAAACTGGnTAGCCAACCAA 

TCAGCATTGTCTATGTTAGTGCAACCAAATGGTGTAGGCAAAACGAGCGTTTCTGCTGC 

GGTAGGAGGTTATAGAGATAAAACTGCATTAGCCATTGGTGTCGGCTCACGCATTACT 

GATCGCTTTACCGCTAAAGCGGGTGTAGCGTTCAATACCTACAATGGCGGCATGTCTTA 

TGGTGCTTCTGTTGGTTATGAATTCTAATCATTACGTTTAATCACTAATCGTTTTGGTTA 

TAATAAAAAGGCTAAATGTTTCTCCTCACATTTAGCCTTTCTTATTTATCTTTGTTATAG 

CTTTTGCTGTTATAAAACCGTTTTTTAGCCACTTTTATTAATTAAGCTTTTAAGCCTATT 

CAATCAGTTCTACTTTCACTTTTTTCACCATATTATCCGCCACTTCTAAAACGGTAATAT 

TAAGTTGGTTTAGCCTAAATTGGGTACCTTCTATCGGAATTTTTTCTAAATGTTCTAAA 

ATTAAG 

SEQ ID NO: 18 PROTEIN SEQUENCE OF 406 (set forth in FIG. 9) 

MKIKCLVAVV GLACSTITTM AQQPPKFAGV SSLYSYEYDY GKGKWTWSNE 
GGFDIKVPGI KMKPKEWISK QATYLELQHY MPYTPVLVTY APGVSPSPIL 
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ISKYLLELGT YLDDSYRMME QNTHNINKLS KELQTGLANQ SALSMLVQPN 
GVGKTSVSAA VGGYRDKTAL AIGVGSRITD RFTAKAGVAF NTYNGGMSYG 
ASVGYEF 
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THAT WHICH IS CLAIMED IS: 

1 . An isolated polynucleotide encoding DsrA, the polynucleotide selected 
from the group consisting of: 

(a) DNA having the nucleotide sequence of SEQ ID NO:l; 

(b) DNA having the nucleotide sequence selected from the group consisting 
of SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID 
NO:ll, SEQ D3 NO:13, SEQ ID NO:15, and SEQ ID NO:17; 

(c) polynucleotides that hybridize to DNA of (a) or (b) above under 
U stringent conditions and which encode DsrA; and 

O ( d ) polynucleotides that differ from the DNA of (a) or (b) or (c) above due 

to the degeneracy of the genetic code, and that encode DsrA encoded by a DNA of 
Ul (a) or (b) above. 

ru 

JU 2. An isolated polynucleotide according to Claim 1 that encodes DsrA. 

Q 

J 3 . An isolated polynucleotide that encodes DsrA, wherein the DsrA has the 

z amino acid sequence given herein as SEQ ID NO:2. 

4. An isolated polynucleotide that encodes DsrA, wherein the DsrA has and 
amino acid sequence selected from the group of SEQ ID NO:4, SEQ ID NO:6, 
SEQ ID NO:8, SEQ D3 NO: 10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID 
NO:16, and SEQ ID NO:18. 

5. An isolated polynucleotide according to Claim 1 which is a DNA having the 
nucleotide sequence given herein as SEQ ID NO:l. 

6. An isolated protein encoded by a polynucleotide according to Claim 1 . 

7. An isolated and purified protein having the amino acid sequence selected 
from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ D3 NO:6, SEQ 
ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, and 
SEQIDNO:18. 
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8. An expression vector comprising a polynucleotide according to Claim 1 . 

9. A cell containing an expression vector according to Claim 8. 

1 0. A cell containing an expression vector according to Claim 8 and capable of 
expressing DsrA. 

11. An antibody that specifically binds to a protein encoded by a 
polynucleotide according to Claim 1. 

12. An antibody according to Claim 1 1 , wherein said antibody is a polyclonal 
antibody. 

13. An antibody according to Claim 1 1, wherein said antibody is a monoclonal 
antibody. 

14. An antisense oligonucleotide complementary to a polynucleotide of Claim 
1 and having a length sufficient to hybridize thereto under physiological 
conditions. 

15. A DNA encoding an antisense oligonucleotide of Claim 1 4. 

16. An expression vector comprising an antisense oligonucleotide according to 
Claim 14. 

17. A method for producing a protein comprising the amino acid sequence 
selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ 
ID NO:16, and SEQ ID NO:18, or a fragment thereof, comprising 

(a) culturing a host cell containing an expression vector containing at least 
a fragment of a polynucleotide sequence encoding DsrA under conditions suitable 
for the expression of the protein; and 

(b) recovering the protein from the host cell culture. 
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1 8. A method for detecting a polynucleotide which encodes DsrA in a 
biological sample comprising: 

(a) hybridizing the complement of the polynucleotide sequence which 
encodes a polynucleotide selected from the group consisting of SEQ ID NO:l, 
SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, 
SEQ ID NO:13, SEQ ED NO:15, and SEQ D> NO:17 to nucleic acid material of a 
biological sample, thereby forming a hybridization complex; and 

u 0>) detecting the hybridization complex, wherein the presence of the 

complex correlates with the presence of a polynucleotide encoding DsrA in the 

yj biological sample. 

O 

Ul 

fU 1 9. The mutant H. ducreyi strain FX5 1 7, wherein the mutant does not encode 

7 or express DsrA. 

O 
jjj 

U 20. A vaccine composition comprising purified protein DsrA or a fragment 

q thereof in a pharmaceutically acceptable carrier. 



Jy 



21. A vaccine composition of Claim 20 further comprising another outer 
membrane protein of H. ducreyi. 

22. A vaccine according to Claim 20 further comprising an adjuvant 

23 . A vaccine composition comprising a polynucleotide of Claim 1 in a 
pharmaceutically acceptable carrier. 

24. A vaccine composition according to Claim 34 wherein the polynucleotide 
has the sequence SEQ ID NO:l. 

25. A vaccine composition comprising an expression vector of Claim 8 in a 
pharmaceutically acceptable carrier. 
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26. A vaccine composition comprising the H. ducreyi mutant FX5 1 7 in a 
pharmaceutical^ acceptable carrier. 

27. A DNA vaccine comprising an attenuated H. ducreyi strain. 

28. A method for inducing a protective immune response in a subject at risk of 
developing H. ducreyi infection comprising administering to the subject a vaccine 
according to one of Claims 20-27 in an amount sufficient to induce an immune 
response. 
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U = Unlabeled OMP 

1 = Surface-labeled H. ducreyi total protein 

2 = Affinity purification, human native Vn 

3 = Affinity purification, human recombinant Vn 

4 = Affinity purification, bovine native Vn 
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Page 2 of 3 



Send correspondence to: 
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Direct telephone calls to: 



F. Michael Sajovec 
(919) 854-1400 

(919) 854-1401 




Residence: 
Citizenship: 
Mailing Address: 



ChapemihV^orth Carolina 

United States jsl . C , 

2216 Ridgewood Road 

Chapel Hill, North Carolina 27516 
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Attorney's Docket No. 5470-269 



PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



ATTN: DO/EO/US 



In re: Christopher Elkins 
Serial No.: to be assigned 
Filed: concurrently herewith 

For: DSRA PROTEIN AND POL YNUCLEOTIDES ENCODING THE SAME 



Date: January 9, 2002 



Box PCT 

Commissioner for Patents 
Washington, DC 20231 



STATEMENT IN SUPPORT OF FILING A 
SEQUENCE LISTING UNDER 37 CFR § 1.821(f) 



Sir: 



I hereby state that the content of the paper and computer readable copies of the 
Sequence listing, submitted concurrently herewith in accordance with 37 CFR § 1.821(c) and 
(e), are the same. 



Express Mail Label No. EV015665109US 
Date of Deposit: January 9, 2002 

I hereby certify that this correspondence is being deposited with the United States Postal Service "Express Mail 
Post Office to Addressee" service under 37 CFR 1.10 on the date indicated above and is addressed to: BOX PCT, 
Attn: DO/EO/US, Commissioner for Patents, Washington, DC 20231. 




Respectfully submitted, 



'll I . 'I III 1 1 1 1< 
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PATENT TRADEMARK OFFICE 



CERTIFICATE OF EXPRESS MAILING 




Monica L. Croom 



10/030529 

AmBrxmr^o 09 jan 2002 



SEQUENCE LISTING 

<110> University of North Carolina-Chapel Hill 
Elkins, Christopher 

<120> Isolated Polynucleotides Encoding DsrA, A Protein Conferring 
Serum Resistance To H. ducreyi, And Methods And Compositions Comprising 
The Same 

<130> 5470-269. WO 
<160> 18 

<170> Patentln version 3.1 

<210> 1 

<211> 1168 

<212> DNA 

<213> Haemophilus ducreyi 

<400> 1 



ataaatacgt 


cattgacatt 


tttttaatgt 


aaggtagaat 


aagaaagtaa 


attctatatt 


60 


tacaatcaag 1 


attgacaatt 


atttacttaa 


tgaggtgatt 


atgaaaatta 


aatgtttagt 


120 


tgccgtag^tg' 


ggattagctt 




tacaacaatg gctcagcagc 


cgccaaagtt 


180 


fccjc fccjcj'acj'fc.a 








ggtaagggta 


aatggacttg 


240 


cffcctaatcjaa 


ggcggtttcg 




gccagggatt 


aaaatgaagc 


caaaagaatg 


300 


gatttctaaa 






acagcattat 


atgccttata 


ctcctgttct 


360 


cgtgacatat 


gctcctggcg 


tttctcctag 


ccctatactg 


ttatatccga tgtctgatcc 


420 


tgatcaactt 


ggaataaatc 


ggcagcagct 


gaaattgaat 


ttgtatagtt 


attttaacga 


480 


tttaagacac 


gattttaaat 


taaaagttct 


tgatgcacgt 


atttccaaaa 


ataaacaaaa 


540 


tattgatact 


ataagtaaat 


atttactaga 


actgggtact 


tatttagatg 


attcttatcg 


600 


tatgatggaa 


caaaatacac 


ataatatcaa 


taagttgtct 


aaagaattgc 


aaactggttt 


660 


agccaaccaa 


tcagcattgt 


ctatgttagt 


gcaaccaaat 


ggtgtaggca 


aaacgagcgt 


720 


ttctgctgcg 


gtaggaggtt 


atagagataa 


aactgcatta 


gccattggtg 


tcggctcacg 


780 


cattactgat 


cgctttaccg 


ctaaagcggg 


tgtagcgttc 


aatacctaca 


atggcggcat 


840 


gtcttatggt 


gcttctgttg 


gttatgaatt 


ctaatcatta 


cgtttaatca 


ctaatcgttt 


900 


tggttataat 


aaaaaggcta 


aatgtttctc 


ctcacattta 


gcctttctta 


tttatctttg 


960 


ttatagcttt 


tgctgttata 


aaaccgtttt 


ttagccactt 


ttattaatta 


agcttttaag 


1020 


cctattcaat 


cagttctact 


ttcacttttt 


tcaccatatt 


atccgccact 


tctaaaacgg 


1080 


taatattaag 


ttggtttagc 


ctaaattggg 


taccttctat 


cggaattttt 


tctaaatgtt 


1140 


ctaaaattaa 


gccgttaaag 


gtgcggac 








1168 



1 



<210> 2 
<211> 257 
<212> PRT 

<213> Haemophilus ducreyi 
<400> 2 

Met Lys lie Lys Cys Leu Val Ala Val Val Gly Leu Ala Cys Ser Thr 
15 10 15 

lie Thr Thr Met Ala Gin Gin Pro Pro Lys Phe Ala Gly Val Ser Ser 
20 25 30 

Leu Tyr Ser Tyr Glu Tyr Asp Tyr Gly Lys Gly Lys Trp Thr Trp Ser 
M, 35 40 45 

=T| Asn Glu Gly Gly Phe Asp lie Lys Val Pro Gly lie Lys Met Lys Pro 

50 55 60 

Lys Glu Trp lie Ser Lys Gin Ala Thr Tyr Leu Glu Leu Gin His Tyr 
65 70 75 80 

o 

yl Met Pro Tyr Thr Pro Val Leu Val Thr Tyr Ala Pro Gly Val Ser Pro 

Q 85 90 95 

Ser Pro lie Leu Leu Tyr Pro Met Ser Asp Pro Asp Gin Leu Gly lie 
" iU 100 105 110 



Asn Arg Gin Gin Leu Lys Leu Asn Leu Tyr Ser Tyr Phe Asn Asp Leu 
115 120 125 



Arg His Asp Phe Lys Leu Lys Val Leu Asp Ala Arg lie Ser Lys Asn 
130 135 140 



Lys Gin Asn lie Asp Thr lie Ser Lys Tyr Leu Leu Glu Leu Gly Thr 
145 150 155 160 



Tyr Leu Asp Asp Ser Tyr Arg Met Met Glu Gin Asn Thr His Asn lie 
165 170 175 



Asn Lys Leu Ser Lys Glu Leu Gin Thr Gly Leu Ala Asn Gin Ser Ala 
180 185 190 



Leu Ser Met Leu Val Gin Pro Asn Gly Val Gly Lys Thr Ser Val Ser 
195 200 205 



Ala Ala Val Gly Gly Tyr Arg Asp Lys Thr Ala Leu Ala He Gly Val 



2 



210 



215 



220 



Gly Ser Arg lie Thr Asp Arg Phe Thr Ala Lys Ala Gly Val Ala Phe 
225 230 235 240 

Asm Thr Tyr Asn Gly Gly Met Ser Tyr Gly Ala Ser Val Gly Tyr Glu 
245 250 255 

Phe 



<210> 3 
<211> 1205 
^ <212> DNA 

;~" <213> Haemophilus ducreyi 

CJ <400> 3 

W attttataat ttacaataca ttttatattt ttatattata taaatacgtc attgacattt 6 0 

01 ttttaaggta gaataagaaa gtaaattcta tatttacaat caagattgac aattatttac 12 0 

ru 

yg ttaatgaggt gattatgaaa attaaatgtt tagttgccgt agtgggatta gcttgttcta 18 0 

f=? ctattacaac aatggctcag cagccgccaa agtttgctgg agtatcttct ttgtatagct 24 0 

W 

rj atgagtatga ctatggtaag ggtaaatgga cttggtctaa tgaaggcggt ttcgatatta 300 

M 

aagtgccagg gattaaaatg aagccaaaag aatggatttc taaacaggct acttatcttg 360 

s 

hJ aattacagca ttatatgcct tatactcctg ttctcgtgac atatgctcat gacgttcctc 42 0 

ctagctctat actgttatat ccgatgtctg atcctgatca acttggaata aatcggcagc 48 0 

agctgaaatt gaatttgtat agttatttta acgatttaag acacgatttt aaattaaaag 54 0 

ttcttgatgc acgtatttcc aaaaataaac aaaatattga fcactataagt aaatatttac 600 

tagaactggg tacttattta gatgattctt atcgtatgat ggaacaaaat acacataata 66 0 

tcaataaaaa tacacataat atcaataagt tgtctaaaga attgcaaact ggtttagcca 72 0 

accaatcagc attgtctatg ttagtgcaac caaatggtgt aggcaaaacg agcgtttctg 78 0 

ctgcggtagg aggttataga gataaaactg cattagccat tggtgtcggc tcacgcatta 840 

ctgatcgctt taccgctaaa gcgggtgtag cgttcaatac ctacaatggc ggcatgtctt 900 

atggtgcttc tgttggttat gaattctaat cattacgttt aatcactaat cgttttggtt 960 

ataataaaaa ggctaaatgt ttctcctcac atttagcctt tcttatttat ctttgttata 102 0 

gcttttgctg ttataaaacc gttttttagc cacttttatt aattaagctt ttaagcctat 108 0 

tcaatcagtt ctactttcac ttttttcacc atattatccg ccacttctaa aacggtaata 1140 

ttaagttggt ttagcctaaa ttgggtacct tctatcggaa ttttttctaa atgttctaaa 1200 



3 



attaa 



1205 



in 

Q 
in 
□ 



<210> 4 
<211> 264 
<212> PRT 

<213> Haemophilus ducreyi 
<400> 4 

Met Lys lie Lys Cys Leu Val Ala Val Val Gly Leu Ala Cys Ser Thr 
15 10 15 

He Thr Thr Met Ala Gin Gin Pro Pro Lys Phe Ala Gly Val Ser Ser 
20 25 30 

Leu Tyr Ser Tyr Glu Tyr Asp Tyr Gly Lys Gly Lys Trp Thr Trp Ser 
35 40 45 

Asn Glu Gly Gly Phe Asp He Lys Val Pro Gly He Lys Met Lys Pro 
50 55 60 

Lys Glu Trp He Ser Lys Gin Ala Thr Tyr Leu Glu Leu Gin His Tyr 
65 70 75 80 

Met Pro Tyr Thr Pro Val Leu Val Thr Tyr Ala His Asp Val Pro Pro 



Ser Ser He Leu Leu Tyr Pro Met Ser Asp Pro Asp Gin Leu Gly He 
100 105 . 110 



Asn Arg Gin Gin Leu Lys Leu Asn Leu Tyr Ser Tyr Phe Asn Asp Leu 
115 120 125 



Arg His Asp Phe Lys Leu Lys Val Leu Asp Ala Arg lie Ser Lys Asn 
130 135 140 



Lys Gin Asn He Asp Thr He Ser Lys Tyr Leu Leu Glu Leu Gly Thr 
145 150 155 160 



Tyr Leu Asp Asp Ser Tyr Arg Met Met Glu Gin Asn Thr His Asn He 
165 170 175 



Asn Lys Asn Thr His Asn He Asn Lys Leu Ser Lys Glu Leu Gin Thr 
180 185 190 



Gly Leu Ala Asn Gin Ser Ala Leu Ser Met Leu Val Gin Pro Asn Gly 
195 200 205 



4 



Val Gly Lys Thr Ser Val Ser Ala Ala Val Gly Gly Tyr Arg Asp Lys 
210 215 220 

Thr Ala Leu Ala lie Gly Val Gly Ser Arg lie Thr Asp Arg Phe Thr 
225 230 235 240 

Ala Lys Ala Gly Val Ala Phe Asn Thr Tyr Asn Gly Gly Met Ser Tyr 
245 250 255 

Gly Ala Ser Val Gly Tyr Glu Phe 

260 





<210> 5 
<211> 952 
<212> DKTA 

<213> Haemophilus ducreyi 










3 

Ul 


<400> 5 
attttataat 


ttacaataca 


ttttatattt 


ttatattata 


taaatacgtc 


attgacattt 


60 




ttttaaggta 


gaataagaaa 


gtaaattcta 


tatttacaat 


caagattgac 


aattatttac 




o 


ttaatgaggt 


gattatgaaa 


attaaatgtt 


tagttgccgt 


agtgggatta 


gcttgttcta 




s 


ctattacaac 


aatggctcag 


cagccgccaa 


agtttgctgg 


agtatcttct 


ttgtatagct 


240 




atgagtatga 


ctatggtaag 


ggtaaatgga 


cttggtctaa 


tgaaggcggt 


ttcgatatta 


300 


ru 


aagtgccagg 


gattaaaatg 


aagccaaaag 


aatggatttc 


taaacaggct 


acttatcttg 


360 




aattacagca 


ttatatgcct 


tatactcctg 


ttctcgtgac 


atatgctcat 


gacgttcctc 


420 




ctagctctat 


actgttatat 


ccgatgtctg 


atcctgatca 


acttggaata 


aatcggcagc 


480 




agctgaaatt 


gaatttgtat 


agttatttta 


acgatttaag 


acacgatttt 


aaattaaaag 


540 




ttcttgatgc 


acgtatttcc 


aaaaataaac 


aaaatattga 


tactataagt 


aaatatttac 


600 




tagaactggg 


tacttattta 


gatgattctt 


atcgtatgat 


ggaacaaaat 


acacataata 


660 




tcaataaaaa 


tacacataat 


atcaataagt 


tgtctaaaga 


attgcaaact 


ggtttagcca 


720 




accaatcagc 


attgtctatg 


ttagtgcaac 


caaatggtgt 


aggcaaaacg 


agcgtttctg 


780 




ctgcggtagg 


aggttataga 


gataaaactg 


cattagccat 


tggtgtcggc 


tcacgcatta 


840 




ctgatcgctt 


taccgctaaa 


gcgggtgtag 


cgttcaatac 


ctacaatggc 


ggcatgtctt 


900 




atggtgcttc 


tgttggttat 


gaattctaat 


cattacgttt 


aatcactaat 


eg 


952 



<210> 6 

<211> 264 

<212> PRT 

<213> Haemophilus ducreyi 



5 



<400> 6 



Met Lys lie Lys Cys Leu Val Ala Val Val Gly Leu Ala Cys Ser Thr 
15 10 15 



lie Thr Thr Met Ala Gin Gin Pro Pro Lys Phe Ala Gly Val Ser Ser 



Leu Tyr Ser Tyr Glu Tyr Asp Tyr Gly Lys Gly Lys Trp Thr Trp Ser 



Asn Glu Gly Gly Phe Asp lie Lys Val Pro Gly lie Lys Met Lys Pro 



M= Lys Glu Trp lie Ser Lys Gin Ala Thr Tyr Leu Glu Leu Gin His Tyr 



Met Pro Tyr Thr Pro Val Leu Val Thr Tyr Ala His Asp Val Pro Pro 



Ser Ser lie Leu Leu Tyr Pro Met Ser Asp Pro Asp Gin Leu Gly lie 
100 105 110 



Asn Arg Gin Gin Leu Lys Leu Asn Leu Tyr Ser Tyr Phe Asn Asp Leu 
115 120 125 



Arg His Asp Phe Lys Leu Lys Val Leu Asp Ala Arg lie Ser Lys Asn 
130 135 140 



Lys Gin Asn lie Asp Thr lie Ser Lys Tyr Leu Leu Glu Leu Gly Thr 
145 150 155 160 



Tyr Leu Asp Asp Ser Tyr Arg Met Met Glu Gin Asn Thr His Asn lie 
165 170 175 



Asn Lys Asn Thr His Asn lie Asn Lys Leu Ser Lys Glu Leu Gin Thr 
180 185 190 



Gly Leu Ala Asn Gin Ser Ala Leu Ser Met Leu Val Gin Pro Asn Gly 
195 200 205 



Val Gly Lys Thr Ser Val Ser Ala Ala Val Gly Gly Tyr Arg Asp Lys 
210 215 220 



Thr Ala Leu Ala lie Gly Val Gly Ser Arg lie Thr Asp Arg Phe Thr 
225 230 235 240 



6 



i 



Ala Lys Ala Gly Val Ala Phe Asn Thr Tyr Asn Gly Gly Met Ser Tyr 
245 250 255 



Gly Ala Ser Val Gly Tyr Glu Phe 
260 



<210> 7 
<211> 899 
<212> DNA 

<213> Haemophilus ducreyi 










ttttataatt 


tacaatacat 


tttatatttt 


tatattatat 


aaatacgtca 


ttgacatttt 


60 


tttaatgtaa 


ggtagaataa 


gaaagtaaat 


tctatattta 


caatcaagat 


tgacaattat 


120 


ttacttaatg 


aggtgattat 


gaaaattaaa 


tgtttagttg 


ccgtagtggg 


attagcttgt 


180 


tctactatta 


caacaatggc 


tcagcagccg 


ccaaagtttg 


ctggagtatc 


ttctttgtat 


240 


agctatgagt 


atgactatgg 


taagggtaaa 


tggacttggt 


ctaatgaagg 


cggtttcgat 


300 


attaaagtgc 


cagggattaa 


aatgaagcca 


aaagaatgga 


tttctaaaca 


ggctacttat 


360 


cttgaattac 


agcattatat 


gccttatact 


cctgttctcg 


tgacatatgc 


tcctggcgtt 


420 


tctcctagcc 


ctatactgtt 


atatccgatg 


tctgatcctg atcaacttgg 


aataaatcgg 


480 


cagcagctga 


aattgaattt 


gtatagttat 


tttaacgatt 


taagacacga 


ttttaaatta 


540 


aaagttcttg 


atgcacgtat 


ttccaaaaat 


aaacaaaata 


ttgatactat 


aagtaaatat 


600 


ttactagaac 


tgggtactta 


tttagatgat 


tcttatcgta 


tgatggaaca 


aaatacacat 


660 


aatatcaata 


agttgtctaa 


agaattgcaa 


actggtttag 


ccaaccaatc 


agcattgtct 


720 


atgttagtgc 


aaccaaatgg 


tgtaggcaaa 


acgagcgttt 


ctgctgcggt 


aggaggttat 


780 


agagataaaa 


ctgcattagc 


cattggtgtc 


ggctcacgca 


ttactgatcg 


ctttaccgct 


840 


aaagcgggtg 


tagcgttcaa 


taccttctat 


cggaattttt 


tctaaatgtt 


ctaaaatta 


899 



<210> 8 

<211> 242 

<212> PRT 

<213> Haemophilus ducreyi 

<400> 8 

Met Lys lie Lys Cys Leu Val Ala Val Val Gly Leu Ala Cys Ser Thr 
1 5 10 15 

lie Thr Thr Met Ala Gin Gin Pro Pro Lys Phe Ala Gly Val Ser Ser 



7 



Leu Tyr Ser Tyr Glu Tyr Asp Tyr Gly Lys Gly Lys Trp Thr Trp Ser 
35 40 45 



Asn Glu Gly Gly Phe Asp lie Lys Val Pro Gly lie Lys Met Lys Pro 



Lys Glu Trp lie Ser Lys Gin Ala Thr Tyr Leu Glu Leu Gin His Tyr 



Met Pro Tyr Thr Pro Val Leu Val Thr Tyr Ala Pro Gly Val Ser Pro 



Ser Pro lie Leu Leu Tyr Pro Met Ser Asp Pro Asp Gin Leu Gly lie 
100 105 ~ 110 



Asn Arg Gin Gin Leu Lys Leu Asn Leu Tyr Ser Tyr Phe Asn Asp Leu 
115 120 ' 125 



Arg His Asp Phe Lys Leu Lys Val Leu Asp Ala Arg lie Ser Lys Asn 
130 135 140 



Lys Gin Asn lie Asp Thr lie Ser Lys Tyr Leu Leu Glu Leu Gly Thr 
145 150 155 160 



Tyr Leu Asp Asp Ser Tyr Arg Met Met Glu Gin Asn Thr His Asn lie 
165 170 175 



Asn Lys Leu Ser Lys Glu Leu Gin Thr Gly Leu Ala Asn Gin Ser Ala 
180 185 190 



Leu Ser Met Leu Val Gin Pro Asn Gly Val Gly Lys Thr Ser Val Ser 
195 200 205 



Ala Ala Val Gly Gly Tyr Arg Asp Lys Thr Ala Leu Ala lie Gly Val 
210 215 220 



Gly Ser Arg lie Thr Asp Arg Phe Thr Ala Lys Ala Gly Val Ala Phe 
225 230 235 240 



<210> 9 

<211> 1197 

<212> DNA 

<213> Haemophilus ducreyi 



<400> 9 

aatggccatt ttataattta caatacattt tatattttta tattatataa atacgtcatt 60 

gacatttttt taatgtaagg tagaataaga aagtaaattc tatatttaca atcaagattg 12 0 

acaattattt acttaatgag gtgattatga aaattaaatg tttagttgcc gtagtgggat 18 0 

tagcttgttc tactattaca acaatggctc agcagccgcc aaagtttgct ggagtatctt 24 0 

ctttgtatag ctatgagtat gactatggta agggtaaatg gacttggtct aatgaaggcg 300 

gtttcgatat taaagtgcca gggattaaaa tgaagccaaa agaatggatt tctaaacagg 360 

ctacttatct tgaattacag cattatatgc cttatactcc tgttctcgtg acatatgctc 420 

ctggcgtttc tcctagccct atactgttat atccgatgtc tgatcctgat caacttggaa 480 

taaatcggca gcagctgaaa ttgaatttgt atagttattt taacgattta agacacgatt 54 0 

Q ttaaattaaa agttcttgat gcacgtattt ccaaaaataa acaaaatatt gatactataa 60 0 

yj gtaaatattt actagaactg ggtacttatt tagatgattc ttatcgtatg atggaacaaa 660 

fjn atacacataa tatcaataag ttgtctaaag aattgcaaac tggtttagcc aaccaatcag 72 0 

fU 

Ifj cattgtctat gttagtgcaa ccaaatggtg taggcaaaac gagcgtttct gctgcggtag 780 

gaggttatag agataaaact gcattagcca ttggtgtcgg ctcacgcatt actgatcgct 84 0 

yi ttaccgctaa agcgggtgta gcgttcaata cctacaatgg cggcatgtct tatggtgctt 90 0 

CH ctgttggtta tgaattctaa tcattacgtt taatcactaa tcgttttggt tataataaaa 96 0 

fU aggctaaatg tttctcctca catttagcct ttcttattta tctttgttat agccttttgc 1020 

tgttataaaa ccgtttttta gccactttta ttaattaagc ttttaagcct attcaatcag 1080 

ttctactttc acttttttca ccatattatc cgccacttct aaaacggtaa tattaagttg 114 0 

gtttagccta aattgggtac cttctatcgg aattttttct aaatgttcta aaattaa 1197 

<210> 10 
<211> 257 
<212> PRT 

<213> Haemophilus ducreyi 
<400> 10 

Met Lys lie Lys Cys Leu Val Ala Val Val Gly Leu Ala Cys Ser Thr 



lie Thr Thr Met Ala Gin Gin Pro Pro Lys Phe Ala Gly Val Ser Ser 
20 25 30 

Leu Tyr Ser Tyr Glu Tyr Asp Tyr Gly Lys Gly Lys Trp Thr Trp Ser 



9 



Asn Glu Gly Gly Phe Asp lie Lys Val Pro Gly lie Lys Met Lys Pro 
50 55 60 

Lys Glu Trp lie Ser Lys Gin Ala Thr Tyr Leu Glu Leu Gin His Tyr 
55 70 75 80 

Met Pro Tyr Thr Pro Val Leu Val Thr Tyr Ala Pro Gly Val Ser Pro 



Ser Pro lie Leu Leu Tyr Pro Met Ser Asp Pro Asp Gin Leu Gly lie 
100 105 no 



Asn Arg Gin Gin Leu Lys Leu Asn Leu Tyr Ser Tyr Phe Asn Asp Leu 
115 120 125 



Arg His Asp Phe Lys Leu Lys Val Leu Asp Ala Arg lie Ser Lys Asn 

130 135 140 



Lys Gin Asn lie Asp Thr lie Ser Lys Tyr Leu Leu Glu Leu Gly Thr 
145 150 155 160 



Tyr Leu Asp Asp Ser Tyr Arg Met Met Glu Gin Asn Thr His Asn lie 
165 170 175 



Asn Lys Leu Ser Lys Glu Leu Gin Thr Gly Leu Ala Asn Gin Ser Ala 
180 185 190 



Leu Ser Met Leu Val Gin Pro Asn Gly Val Gly Lys Thr Ser Val Ser 
195 200 205 



Ala Ala Val Gly Gly Tyr Arg Asp Lys Thr Ala Leu Ala lie Gly Val 
210 215 220 



Gly Ser Arg He Thr Asp Arg Phe Thr Ala Lys Ala Gly Val Ala Phe 
225 230 235 ~ 240 



Asn Thr Tyr Asn Gly Gly Met Ser Tyr Gly Ala Ser Val Gly Tyr Glu 
245 250 255 



<210> 11 

<211> 923 

<212> DNA 

<213> Haemophilus ducreyi 



10 



<400> 11 

tatttacaat caagattgac aattatttac ttaatgaggt gattatgaaa attaaatgtt 60 

tagttgccgt agtgggatta gcttgttcta ctattacaac aatggctcag cagccgccaa 12 0 

agtttgctgg agtatcttct ttggatagct atgagtatga ctatggtaag ggtaaatgga 180 

cttggtctga aaaagacggt ttcgatatta aagcgccagg gattaaaatg aagccaaaaa 240 

aatggatttc tagacaggct acttatcttg gattacagca ttatatgcct tatactcctg 300 

ttctcgtgac atatgcttct gcagaaccta acactgtact gttatatccg atgcctgatc 360 

ctgatcaact tggaataaat cggcagcagc tgaaattgaa tttgtatagt tattttaacg 420 

atttaagaca cggttttaaa ttaaatgttc ttgatgcacg tatttcccaa aataaacaaa 480 

atattgatac tataagtgaa tatttactaa aactgggtac ttatttagat agttcttatc 540 

gtatgatgga acaaaataca cataatatca ataaaaatac acataatatc aataagttgt 600 

ctaaagaatt gcaaactggt ttagccaacc aatcagcatt gtctatgtta gtgcaaccaa 660 

atggtgtagg caaaacgagc gtttctgctg cggtaggagg ttatagagat aaaactgcat 720 

tagccattgg tgtcggctca cgcattactg atcgctttac cgctaaagcg ggtgtagcgt 780 

tcaataccta caatggcggc atgtcttatg gtgcttctgt tggttatgaa ttctaatcat 840 

tacgtttaat cactaatcgt tttggttata ataaaaaggc taaatgtttc tcctcacatt 900 

tagcctttct tatttatctt tgt 923 

<210> 12 
<211> 263 
<212> PRT 

<213> Haemophilus ducreyi 
<400> 12 

Met Lys He Lys Cys Leu Val Ala Val Val Gly Leu Ala Cys Ser Thr 



He Thr Thr Met Ala Gin Gin Pro Pro Lys Phe Ala Gly Val Ser Ser 

20 25 30 

Leu Asp Ser Tyr Glu Tyr Asp Tyr Gly Lys Gly Lys Trp Thr Trp Ser 

35 40 45 

Glu Lys Asp Gly Phe Asp He Lys Ala Pro Gly He Lys Met Lys Pro 

50 55 60 

Lys Lys Trp He Ser Arg Gin Ala Thr Tyr Leu Gly Leu Gin His Tyr 



11 



Met Pro Tyr Thr Pro Val Leu Val Thr Tyr Ala Ser Ala Glu Pro Asn 
85 90 95 



Thr Val Leu Leu Tyr Pro Met Pro Asp Pro Asp Gin Leu Gly lie Asn 
100 105 110 

Arg Gin Gin Leu Lys Leu Asn Leu Tyr Ser Tyr Phe Asn Asp Leu Arg 
115 120 * 12 5 

His Gly Phe Lys Leu Asn Val Leu Asp Ala Arg lie Ser Gin Asn Lys 
130 135 140 

Gin Asn lie Asp Thr lie Ser Glu Tyr Leu Leu Lys Leu Gly Thr Tyr 
145 150 155 160 

p Leu Asp Ser Ser Tyr Arg Met Met Glu Gin Asn Thr His Asn He Asn 

yj 165 170 175 

O 

ry L y s Asn Thr His Asn He Asn Lys Leu Ser Lys Glu Leu Gin Thr Gly 

g 180 185 * 190 

U Leu Ala Asn Gin Ser Ala Leu Ser Met Leu Val Gin Pro Asn Gly Val 

Ut 195 200 205 

o 
m 

O GlY Lys Thr Ser Val Ser Ala Ala Val Gly Gly Tyr Arg Asp Lys Thr 

ffj 210 215 220 

Ala Leu Ala He Gly Val Gly Ser Arg He Thr Asp Arg Phe Thr Ala 
225 230 235 240 

Lys Ala Gly Val Ala Phe Asn Thr Tyr Asn Gly Gly Met Ser Tyr Gly 
245 250 255 

Ala Ser Val Gly Tyr Glu Phe 
260 

<210> 13 
<211> 1231 
<212> DNA 

<213> Haemophilus ducreyi 
<400> 13 

cttttataat ttacaataca ttttatattt ttatattata taaatacgtc attgacattt 6 0 

ttttaatgta aggtagaata agaaagtaaa ttctatattt acaatcaaga ttgacaatta 120 
tttacttaat gaggtgatta tgaaaattaa atgtttagtt gccgtagtgg gattagcttg 180 
ttctactatt acaacaatgg ctcagcagcc gccaaagttt gctggagtat cttctttgta 240 
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tagctatgag 


tatgactatg 


gtaagggtaa 




tctaatgaag 


gcggtttcga 




tattaaagtg 


ccagggatta 






atttctaaac 


aggctactta 




tcttgaatta 


cagcattata 


tgccttatac 




gtgacatctg 


ctcctgacgt 




t cc tcctagc 


tctatactgt 






gatcaacttg gaataaatcg 




gcagcagctg 






tt ttaacgat 


ttaagacacg 


attttaaatt 


540 


aaaagttctt 


gatgcacgta 






attgatacta 


taagtaaata 




tttactagaa 








atgatggaac 


aaaatacaca 




taatatcaat 


aaaaatacac 




taaaaataca 


cataatatca 


ataagttgtc 




taaagaattg 






atcagcattg 


tctatgttag 


tgcaaccaaa 


780 


tggtgtaggc 




tttctgctgc 


ggtaggaggt 


tatagagata 


aaactgcatt 


840 


agccattggt 






tcgctttacc gctaaagcgg gtgtagcgtt 






aatggcggca 


tgtcttatgg 


tgcttctgtt 


ggttatgaat 


tctaatcatt 


960 


acgtttaatc 


actaatcgtt 


ttggttataa 


taaaaaggct 


aaatgtttct 


cctcacattt 


1020 


agcctttctt 


atttatcttt 


gttatagctt 


ttgctgttat 


aaaaccgttt 


tttagccact 


1080 


tttattaatt 


aagcttttaa 


gcctattcaa 


tcagttctac 


tttcactttt 


ttcaccatat 


1140 


tatccgccac 


ttctaaaacg 


gtaatattaa 


gttggtttag 


cctaaattgg 


gtaccttcta 


1200 


tcggaatttt 


ttctaaatgt 


tctaaaatta 


a 






1231 



<210> 14 
<211> 271 
<212> PRT 

<213> Haemophilus ducreyi 
<400> 14 

Met Lys lie Lys Cys Leu Val Ala Val Val Gly Leu Ala Cys Ser Thr 
15 10 15 

lie Thr Thr Met Ala Gin Gin Pro Pro Lys Phe Ala Gly Val Ser Ser 
20 25 30 

Leu Tyr Ser Tyr Glu Tyr Asp Tyr Gly Lys Gly Lys Trp Thr Trp Ser 
35 40 45 

Asn Glu Gly Gly Phe Asp He Lys Val Pro Gly He Lys Met Lys Pro 
50 55 60 

Lys Glu Trp He Ser Lys Gin Ala Thr Tyr Leu Glu Leu Gin His Tyr 
65 70 75 80 



13 



Met Pro Tyr Thr Pro Val Leu Val Thr Ser Ala Pro Asp Val Pro Pro 
85 90 95 



Ser Ser lie Leu Leu Tyr Pro Met Ser Asp Pro Asp Gin Leu Gly He 
100 105 110 



Asn Arg Gin Gin Leu Lys Leu Asn Leu Tyr Ser Tyr Phe Asn Asp Leu 
115 120 " 125 



Arg His Asp Phe Lys Leu Lys Val Leu Asp Ala Arg He Ser Lys Asn 
130 135 140 



Lys Gin Asn He Asp Thr He Ser Lys Tyr Leu Leu Glu Leu Gly Thr 
145 150 155 160 



Tyr Leu Asp Gly Ser Tyr Arg Met Met Glu Gin Asn Thr His Asn He 
165 170 175 



Asn Lys Asn Thr His Asn He Asn Lys Asn Thr His Asn He Asn Lys 
180 185 190 



Leu Ser Lys Glu Leu Gin Thr Gly Leu Ala Asn Gin Ser Ala Leu Ser 
195 200 205 



Met Leu Val Gin Pro Asn Gly Val Gly Lys Thr Ser Val Ser Ala Ala 
210 215 220 



Val Gly Gly Tyr Arg Asp Lys Thr Ala Leu Ala He Gly Val Gly Ser 
225 230 235 240 



Arg He Thr Asp Arg Phe Thr Ala Lys Ala Gly Val Ala Phe Asn Thr 
245 250 255 



Tyr Asn Gly Gly Met Ser Tyr Gly Ala Ser Val Gly Tyr Glu Phe 
260 265 270 



<210> 15 
<211> 1047 
<212> DNA 

<213> Haemophilus ducreyi 
<400> 15 

ttttataatt tacaatacat tttatatttt tatattatat aaataccgtc attgacattt 
ttttaatgta aggtagaata agaaagtaaa ttctatattt acaatcaaga ttgacaatta 
tttacttaat gaggtgatta tgaaaattaa atgtttagtt gccgtagtgg gattagcttg 
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acaacaatgg ctcagcagcc gccaaagttt gctggagtat cttctttgta 240 

tatgactatg gtaagggtaa atggacttgg tctaatgaag gcggtttcga 300 

ccagggatta aaatgaagcc aaaagaatgg atttctaaac aggctactta 36 0 

cagcattata tgccttatac tcctgttctc gtgacatctg ctcctgacgt 42 0 

tctatctcta tactgttata tccgatgtct gatcctgatc aacttggaat 480 

cagctgaaat tgaatttgta tagttatttt aacgatttaa gacacgattt 540 

gttcttgatg cacgtatttc caaaaataaa caaaatattg atactataag 600 

ctagaactgg gtacttattt agatggttct tatcgtatga tggaacaaaa 660 

atcaataaaa atacacataa tatcaataaa aatacacata atatcaataa 720 

gaattgcaaa ctggtttagc caaccaatca gcattgtcta tgttagtgca 780 

gtaggcaaaa cgagcgtttc tgctgcggta ggaggttata gagataaaac 84 0 

attggtgtcg gctcacgcat tactgatcgc tttaccgcta aagcgggtgt 90 0 

acctacaatg gcggcatgtc ttatggtgct tctgttggtt atgaattcta 96 0 

ttaatcacta atcgttttgg ttataataaa aaggctaaat gtttctcctc 102 0 

ttttcttatt tatcttt 1047 

<213> Haemophilus ducreyi 
<400> 16 

Met Lys lie Lys Cys Leu Val Ala Val Val Gly Leu Ala Cys Ser Thr 
1 5 10 15 



lie Thr Thr Met Ala Gin Gin Pro Pro Lys Phe Ala Gly Val Ser Ser 
20 25 30 



Leu Tyr Ser Tyr Glu Tyr Asp Tyr Gly Lys Gly Lys Trp Thr Trp Ser 
35 40 45 



Asn Glu Gly Gly Phe Asp lie Lys Val Pro Gly lie Lys Met Lys Pro 
50 55 60 



Lys Glu Trp He Ser Lys Gin Ala Thr Tyr Leu Glu Leu Gin His Tyr 
65 70 75 80 



Met Pro Tyr Thr Pro Val Leu Val Thr Ser Ala Pro Asp Val Ser Pro 
85 90 95 



ttctactatt 
tagctatgag 
tattaaagtg 
tcttgaatta 
ttctcctagc 
aaatcggcag 
taaattaaaa 
taaatattta 
tacacataat 
Q gttgtctaaa 
sT; accaaatggt 
rS tgcattagcc 
agcgttcaat 
atcattacgt 
acatttagcc 



ill 



Q 
Rj 



<210> 16 
<211> 273 
<212> PRT 
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Ser Ser lie Ser lie Leu Leu Tyr Pro Met Ser Asp Pro Asp Gin Leu 
100 105 110 



Gly lie Asn Arg Gin Gin Leu Lys Leu Asn Leu Tyr Ser Tyr Phe Asn 
115 120 125 



Asp Leu Arg His Asp Phe Lys Leu Lys Val Leu Asp Ala Arg lie Ser 
130 135 140 



Lys Asn Lys Gin Asn lie Asp Thr lie Ser Lys Tyr Leu Leu Glu Leu 
145 150 155 160 



Gly Thr Tyr Leu Asp Gly Ser Tyr Arg Met Met Glu Gin Asn Thr His 
165 170 175 



Asn lie Asn Lys Asn Thr His Asn lie Asn Lys Asn Thr His Asn lie 
180 185 190 



Asn Lys Leu Ser Lys Glu Leu Gin Thr Gly Leu Ala Asn Gin Ser Ala 
195 200 205 



Leu Ser Met Leu Val Gin Pro Asn Gly Val Gly Lys Thr Ser Val Ser 
210 215 220 



Ala Ala Val Gly Gly Tyr Arg Asp Lys Thr Ala Leu Ala lie Gly Val 
225 230 235 240 



Gly Ser Arg lie Thr Asp Arg Phe Thr Ala Lys Ala Gly Val Ala Phe 
245 250 255 



Asn Thr Tyr Asn Gly Gly Met Ser Tyr Gly Ala Ser Val Gly Tyr Glu 
260 265 270 



<210> 17 
<211> 1189 
<212> DNA 

<213> Haemophilus ducreyi 
<400> 17 

attttataat ttacaataca tttttatttt tatattatat aaatacgtca ttgacatttt 60 
tttaatgtaa ggtagaataa gaaagtaaat tctatattta caatcaagat tgacaattat 12 0 
ttacttaatg aggtgattat gaaaattaaa tgtttagttg ccgtagtggg attagcttgt 18 0 
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caacaatggc tcagcagccg ccaaagtttg ctggagtatc ttctttgtat 240 

atgactatgg taagggtaaa tggacttggt ctaatgaagg cggtttcgat 3 00 

cagggattaa aatgaagcca aaagaatgga tttctaaaca ggctacttat 360 

agcattatat gccttatact cctgttctcg tgacatatgc tcctggcgtt 42 0 

ctatactgtt atatccgatg tctgatcctg atcaacttgg aataaatcgg 480 

aattgaattt gtatagttat tttaacgatt taagacacga ttttaaatta 540 

atgcacgtat ttccaaaaat aaacaaaata ttgatactat aagtaaatat 6 00 

tgggtactta tttagatgat tcttatcgta tgatggaaca aaatacacat 660 

agttgtctaa agaattgcaa actggtttag ccaaccaatc agcattgtct 72 0 

aaccaaatgg tgtaggcaaa acgagcgttt ctgctgcggt aggaggttat 780 

ctgcattagc cattggtgtc ggctcacgca ttactgatcg ctttaccgct 840 

tagcgttcaa tacctacaat ggcggcatgt cttatggtgc ttctgttggt 900 

aatcattacg tttaatcact aatcgttttg gttataataa aaaggctaaa 960 

cacatttagc ctttcttatt tatctttgtt atagcttttg ctgttataaa 102 0 

agccactttt attaattaag cttttaagcc tattcaatca gttctacttt 1080 

accatattat ccgccacttc taaaacggta atattaagtt ggtttagcct 114 0 

ccttctatcg gaattttttc taaatgttct aaaattaag 118 9 

<213> Haemophilus ducreyi 
<400> 18 

Met Lys lie Lys Cys Leu Val Ala Val Val Gly Leu Ala Cys Ser Thr 
1 5 10 15 



lie Thr Thr Met Ala Gin Gin Pro Pro Lys Phe Ala Gly Val Ser Ser 
20 25 30 



Leu Tyr Ser Tyr Glu Tyr Asp Tyr Gly Lys Gly Lys Trp Thr Trp Ser 
35 40 45 



Asn Glu Gly Gly Phe Asp lie Lys Val Pro Gly lie Lys Met Lys Pro 
50 55 60 



Lys Glu Trp lie Ser Lys Gin Ala Thr Tyr Leu Glu Leu Gin His Tyr 
65 70 75 80 



tctactatta 
agctatgagt 
attaaagtgc 
cttgaattac 
tctcctagcc 
cagcagctga 
aaagttcttg 
ttactagaac 
Li aatatcaata 
p atgttagtgc 
S agagataaaa 
|j aaagcgggtg 
tatgaattct 
Q tgtttctcct 

m 

□ accgtttttt 

pi 

pi cacttttttc 

fU 

*" aaattgggta 

<210> 18 
<211> 257 
<212> PRT 
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Met Pro Tyr Thr Pro Val Leu Val Thr Tyr Ala Pro Gly Val Ser Pro 
85 90 95 



Ser Pro lie Leu Leu Tyr Pro Met Ser Asp Pro Asp Gin Leu Gly lie 
100 105 110 



Asn Arg Gin Gin Leu Lys Leu Asn Leu Tyr Ser Tyr Phe Asn Asp Leu 
115 120 125 



Arg His Asp Phe Lys Leu Lys Val Leu Asp Ala Arg lie Ser Lys Asn 
130 135 140 



Lys Gin Asn lie Asp Thr lie Ser Lys Tyr Leu Leu Glu Leu Gly Thr 
145 150 155 160 



Tyr Leu Asp Asp Ser Tyr Arg Met Met Glu Gin Asn Thr His Asn lie 
165 170 175 



Asn Lys Leu Ser Lys Glu Leu Gin Thr Gly Leu Ala Asn Gin Ser Ala 
180 185 190 



Leu Ser Met Leu Val Gin Pro Asn Gly Val Gly Lys Thr Ser Val Ser 
195 200 205 



Ala Ala Val Gly Gly Tyr Arg Asp Lys Thr Ala Leu Ala lie Gly Val 
210 215 220 



Gly Ser Arg He Thr Asp Arg Phe Thr Ala Lys Ala Gly Val Ala Phe 
225 230 235 240 



Asn Thr Tyr Asn Gly Gly Met Ser Tyr Gly Ala Ser Val Gly Tyr Glu 
245 250 255 
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